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GTP-BINDING ASSOCIATED PROTEINS 
TECHNICAL FIELD 

5 This invention relates to nucleic acid and amino acid sequences of GTP-binding associated 

proteins and to the use of these sequences in the diagnosis, treatment, and prevention of immune system, 
reproductive, nervous system, and cell signaling disorders, and cell proliferative disorders including 
cancer. 

10 BACKGROUND OF THE INVENTION 

Guanine nucleotide binding proteins (GTP-binding proteins) are present in all eukaryotic cells 
and function in processes including metabolism, cellular growth, differentiation, signal transduction, 
cytoskeletal organization, and intracellular vesicle transport and secretion. In higher organisms they are 
involved in signaling that regulates such processes as the immune response (Aussel, C. et al. (1988) J. 

15 Immunol. 140:215-220), apoptosis, differentiation, and cell proliferation including oncogenesis 
(Dhanasekaran, N. et al. (1998) Oncogene 17:1383-1394). 

The superfamily of GTP-binding proteins can be subdivided into groups such as translational 
factors, heterotrimeric GTP-binding proteins involved in transmembrane signaling processes (also 
called G-proteans), proto-oncogene Ras proteins, other low molecular weight GTP-binding proteins 

20 including the products of rab, rap, rho, rac, smg21, smg25, YPT, SEC4, and ARF genes, and tubulins 
(Kaziro, Y. et al. (1991) Annu. Rev. Biochem. 60:349-400). 

GTP-binding proteins are involved in protein biosynthesis and include initiation factor 2 (IF-2), 
elongation factor 2 (EF-Tu), and elongation factor G (EF-G), observed in prokaryotes; and initiation 
factor 2 (eEF-2), elongation factor la (EF-Ia), elongation factor 2 (EF-2), and release factor 3 (eRF3) 

25 observed in eukaryotes (Kaziro, supra) . IF-2 promotes the GTP-dependent binding of the tRNA to the 
small subunit of the ribosome, the step that initiates protein translation. Elongation factors promote the 
binding of tRNA and GTP and the displacement of GDP after hydrolysis as protein biosynthesis 
proceeds. eRF3 participates in the recognition of stop codons and the release of nascent proteins from 
ribosomes. 

30 Heterotrimeric GTP-binding proteins are composed of 3 subunits (a, P and y) which, in the 

resting state, associate as a trimer at the inner face of the plasma membrane. Heterotrimeric G- 
proteins may be classified based on the sequence similarity of a subunits into the Gs, Gi, Gq and G12 
subgroups. In the resting state, the a subunit binds guanosine diphosphate (GDP), and stimulation of 
the G-protein by an activated receptor leads to exchange of GDP for guanosine triphosphate (GTP). 

35 This exchange results in the separation of the a from the P and y subunits, which remain tightly 
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associated as a dimer. Both the a subunit and P-y dimer are then able to interact with effectors, either 
individually or in a cooperative manner. The intrinsic GTPase activity of the a subunit hydrolyzes 
the bound GTP to GDP. This returns the a subunit to its inactive conformation and allows it to 
reassociate with the P-v complex, thus restoring the system to its resting state (Kaziro, supra). Some a 
5 subunits show tissue-specific expression indicating a specialized signaling role (Dhanasekaran, supra). 
The a-s class of G-protein subunits is sensitive to ADP-ribosylation by pertussis toxin which 
uncouples the receptor:G-protein interaction. This uncoupling blocks signal transduction to receptors 
that decrease cAMP levels. cAMP levels regulate ion channels and activate phospholipases. The 
inhibitory a-I class is also susceptible to modification by pertussis toxin, which prevents a-I from 
10 lowering cAMP levels. Two novel classes of a subunits refractory to pertussis toxin modification are 
a-q, which activates phospholipase C, and a-1 2, which has sequence homology with the Drosophila 
gene concertina and may contribute to the regulation of embryonic development (Simon, M.I. (1991) 
Science 252:802-808). 

The mammalian G-protein p and y subunits, each about 340 amino acids long, share more than 
15 80% homology. The p subunit (also called p-transducin) contains seven repeating units, each about 43 
amino acids long. This WD-repeat, or G-beta repeat motif, is found in a variety of proteins with 
regulatory function such as Seel 3, a yeast WD repeat protein involved in vesicular traffic; coronin-2, 
a mammalian WD repeat protein involved in regulation of the actin cytoskeleton; and Bopl, a 
mammalian WD repeal protein involved in growth suppression (Garcia-Higuera, I. et al. (1998) J. 
20 Biol. Chem. 273:9041-9049; Okumura, M et al. (1998) DNA Cell Biol. 17:779-787; Pestov, D.G. et 
al. (1998) Oncogene 17:3187-3197). The activity of the p and y subunits may be regulated by other 
proteins such as calmodulin, phosducin, or the neural protein GAP 43 (Clapham, D.E. and E.J. Neer 
(1993) Nature 365:403-406). The p subunit sequences are highly conserved among species, suggesting 
that they pa-form a fundamentally important role in the organization and function of G-protein linked 
25 systems (Van der Voorn, L. and H.L. Ploegh (1992) FEBS Lett. 307:131-134). 

Mutations and variant expression of p-transducin proteins are linked with various disorders. 
Mutations in LIS 1, a subunit of the human platelet activating factor acetylbydrolase, cause Miller- 
Dieker lissencephaly. RACK1 binds activated protein kinase C, and RbAp48 binds retinoblastoma 
protein. CstF is required for polyadenylation of mammalian pre-mRNA in vitro and associates with 
30 subunits of cleavage-stimulating factor. Defects in the regulation of P-catenin contribute to the 
neoplastic transformation of human cells. The WD40 repeats of the human F-box protein pTrCP 
mediate binding to p-catenin, thus regulating the targeted degradation of P-catenin by ubiquitin ligase 
(Neer, E.J. et al. (1994) Nature 371:297-300; Hart, M. et al. (1999) Curr. Biol. 9:207-210). 

The y subunit sequences are more variable than those of the P subunits. They are often post- 
35 translationally modified by isoprenylation and carboxyl-methylation of a cysteine residue four amino 
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acids from the C-terminus. These modifications appear to be necessary for the interaction of the p-y 
dimer with the membrane and with other GTP-binding proteins. The P-y dimer has been shown to 
modulate the activity of adenylyl cyclase isoforms, phospholipase C, and some ion channels. It is 
involved in receptor phosphorylation via specific kinases and has been implicated in the p21ras- 

5 dependent activation of the MAP kinase cascade and the recognition of specific receptors by GTP- 
binding proteins (Clapham and Neer, supra) . 

G-proteins interact with a variety of effectors including adenylyl cyclase (Clapham and Neer, 
supra) . The signaling pathway mediated by cAMP is mitogenic in hormone-dependent endocrine tissues 
such as adrenal cortex, thyroid, ovary, pituitary, and testes. Cancers in these tissues have been related 

10 to a mutationally activated form of a Gc^ known as the gsp (Gs protein) oncogene (Dhanasekaran, 
supra) . Another effector is phosducin, a retinal phosphoprotein, which forms a specific complex with 
retinal G-protein p and y subunits and modulates the ability of the p-y dimer to interact with retinal a 
subunits (Clapham and Neer, supra) . Additional G-protein effectors include RIN1 (Ras 
interaction/interference), which acts as an effector of H-Ras and interferes with the Ras signal 

15 transduction pathway; Rabin3, which associates with the Ras-like GTPase Rab3A; and Rhotekin, a 
protein that binds with, and inhibits, Rho GTPase activity (Han, L. and J. Colicdli (1995) Mol. Cell 
Biol. 15:1318-1323; Brondyk, W.H. et al. (1995) Mol. Ceil Biol. 15:1137-1143; and Reid, T. et al. 
(1996) J. Biol. Cheia 27:13556-13560). 

The low molecular weight GTP-binding proteins regulate cell growth, cell cycle control, protein 

20 secretion, and intracellular vesicle interaction. These GTP-binding proteins respond to extracellular 
signals from receptors and activating proteins by transducing mitogenic signals (Tavitian, A. (1995) C. 
R. Seances Soc. Biol. Fil. 189:7-12). Low molecular weight GTP-binding proteins consist of single 
polypeptides of 21-30kD which, like the a subunit of heterotrimeric GTP-binding proteins, are able to 
bind to and hydrolyze GTP, thus cycling from an inactive to an active state. The intrinsic rate of GTP 

25 hydrolysis of these GTP-binding proteins is typically very slow, but it can be stimulated by several 
orders of magnitude by GTPase-activating proteins (GAPs), such as p2-chimaerin (Geyer, M. and 
Wiltinghofer, A. (1997) Curr. Opia Struct. Biol. 7:786-792; Caloca, M. J. et al. (1997) J. Biol. Chem. 
272:26488-26496). 

Low molecular weight GTP-binding proteins play critical roles in cellular protein trafficking 
30 events, such as the translocation of proteins and soluble complexes from the cytosol to the membrane 
through an exchange of GDP for GTP (Ktistakis, NT. (1998) BioEssays 20:495-504). In vesicle 
transport, the interaction between vesicle- and target- specific identifiers (v-SNAREs and tSNAREs) 
docks the vesicle to the acceptor membrane. The budding process is regulated by GTPases such as the 
closely related ADP ribosylation factors (ARFs) and SAR proteins, while GTPases such as Rab allow 
35 assembly of SNARE complexes and may play a role in removal of defective complexes (Rothman, J.E. 
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and FT. Wieland (1996) Science 272:227-234). The rab proteins control the translocation of vesicles 
to and from membranes for protein localization, protein processing, and secretion. The rho GTP- 
binding proteins control signal transduction pathways that link growth factor receptors to actin 
polymerization which is necessary for normal cellular growth and division. The ran GTP-binding 
5 proteins are located in the nucleus of cells and have a key role in nuclear protein import, the control of 
DNA synthesis, and cell-cycle progression (Hall, A. (1990) Science 249:635-640; Scheffzek, K. et al. 
(1995) Nature 374:378-381). 

The Ras proteins Rasl, Ras2 and G s a stimulate adenylyl cyclase (Kaziro, supra) which affects 
a broad array of cellular processes including determination of whether cells continue to grow or 

10 become terminally differentiated. Stimulation of cell surface receptors activates Ras which, in turn, 
activates cytoplasmic kinases. These kinases translocate to the nucleus and activate key transcription 
factors that control gene expression and protein synthesis (Baibacid, M. (1987) Annu. Rev. Biochem. 
56:779-827; Treisman, R. (1994) Curr. Opin. Genet. Dev. 4:96-101). Mutant Ras-family proteins 
which bind but cannot hydrolyze GTP are permanently activated and are thus rendered oncogenic 

15 (Drivas, G.T. et al. (1990) Mol. Cell. Biol. 10:1793-1798). 

Ras-like proteins have also been implicated in tumor suppression. For example, NOEY2, a 
novel gene encoding a Ras-like protein, is expressed in normal ovarian and breast epithelial cells. 
However, NOEY2 expression is reduced or abrogated in ovarian and breast carcinomas, suggesting a 
role for the NOEY2 gene product in tumor suppression (Yu, Y. et al. (1999) Proc. Natl. Acad Sci. 

20 USA 96:214-219). 

Irregularities in GTP-binding protein signaling cascades may result in abnormal activation of 
leukocytes and lymphocytes, leading to the tissue damage and destruction seen in many inflammatory 
and autoimmune diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic anemia, lupus 
erythematosus, and thyroiditis. Abnormal cell proliferation, including cyclic AMP-mediated 

25 stimulation of brain, thyroid, adrenal, and gonadal tissue proliferation is regulated by G proteins. 
Mutations in G a subunits have been found in growth-hormone-secreting pituitary somatotroph 
tumors, hyperfunctioning thyroid adenomas, and ovarian and adrenal neoplasms (Meij, J.T.A. (1996) 
Mol. Cell. Biochem. 157:31-38; Aussel, supra) . 

The discovery of new GTP-binding associated proteins and the polynucleotides encoding them 

30 satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of immune system, reproductive, nervous system, and cell signaling disorders, and cell 
proliferative disorders including cancer. 

SUMMARY OF THE INVENTION 

35 The invention features purified polypeptides, GTP-binding associated proteins, referred to 
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collectively as "GBAP" and individually as "GBAP-1," "GBAP-2," "GBAP-3," "GBAP-4," "GBAP- 
5," "GBAP-6," "GBAP-7," "GBAP-8," "GBAP-9," "GBAP-10," "GBAP-11," "GBAP-12," "GBAP- 
13/' "GBAP-14," "GBAP-15," "GBAP-16," "GBAP-17," "GBAP-18," "GBAP-19," "GBAP-20," 
"GBAP-21," "GBAP-22," "GBAP-23," "GBAP-24," "GBAP-25." "GBAP-26," "GBAP-27," 

5 "GBAP-28," "GBAP-29," "GBAP-30," "GBAP-3 1," "GBAP-32," "GBAP-33," "GBAP-34," 
"GBAP-35," "GBAP-36," "GBAP-37," "GBAP-38," "GBAP-39," "GBAP-40," "GBAP-41," 
"GBAP-42," "GBAP-43," "GBAP-44," "GBAP-45," "GBAP^6," "GBAP-47," "GBAP-48," 
"GBAP-49," "GBAP-50," "GBAP-51," "GBAP-52," "GBAP-53/ , "GBAP-54," "GBAP-55," 
"GBAP-56," "GBAP-57/' "GBAP-58," "GBAP-59," "GBAP-60," "GBAP-6ir "GBAP-62/' 

10 "GBAP-63," "GBAP-64," "GBAP-65," and "GBAP-66." In one aspect, the invention provides an 
isolated polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-66, c) a biologically active fragment of an amino acid sequence 

15 selected from the group consisting of SEQ ID NO: 1-66, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-66. In one alternative, the invention 
provides an isolated polypeptide comprising the amino acid sequence of SEQ ID NO:l-66. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 

20 group consisting of SEQ ID NO: 1-66, b) a naturally occurring amino acid sequence having at least 
90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l - 
66, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-66, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-66. In one alternative, the polynucleotide encodes a polypeptide selected 

25 from the group consisting of SEQ ID NO: 1-66. In another alternative, the polynucleotide is selected 
from the group consisting of SEQ ID NO:67-132. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 

30 consisting of SEQ ID NO:l -66, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l -66, c) 
a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-66, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-66. In one alternative, the invention provides a cell transformed with the 

35 recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
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comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-66, b) a naturally occurring amino acid sequence having at least 90% 
5 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-66, c) 
a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-66, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-66. The method comprises a) culturing a cell under conditions suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 

10 comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 
recovering the polypeptide so expressed 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally occurring amino acid 

15 sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-66, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l -66, and d) an immunogenic fragment of an amino acid 
sequence sdected from the group consisting of SEQ ID NO: 1-66. 

The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

20 selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:67-132, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:67-l 32, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous • 

25 nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:67-l 32, b) a naturally occurring polynucleotide sequence having at least 70% sequence 

30 identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:67-l 32, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 

35 whereby a hybridization complex is formed between said probe and said target polynucleotide or 
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fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
5 target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:67-132, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:67-132, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
10 an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 
fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 

15 comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-66, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-66, c) a biologically active fragment of an amino acid sequence sdected from the group 
consisting of SEQ ID NO: 1-66, and d) an immunogenic fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1-66, and a pharmaceutically acceptable excipient In one 
embodiment, the 1 composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-66. The invention additionally provides a method of treating a disease or condition 
associated with decreased expression of functional GBAP, comprising administering to a patient in need 
of such treatment the composition. 

25 The invention also provides a method for screening a compound for. effectiveness as an 

agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-66, c) a biologically active fragment of an amino acid sequence 

30 selected from the group consisting of SEQ ID NO: 1-66, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-66. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 
sample. In one alternative, the invention provides a composition comprising an agonist compound 
identified by the method and a pharmaceutically acceptable excipient In another alternative, the 

35 invention provides a method of treating a disease or condition associated with decreased expression of 
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functional GB AP, comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally 

5 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ED NO: 1-66, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-66, and d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO:l-66. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 

10 antagonist activity in the sample. In one alternative, the invention provides a composition comprising 
an antagonist compound identified by the method and a pharmaceutical^ acceptable excipient. In 
another alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional GB AP, comprising administering to a patient in need of such treatment 
the composition. 

15 The invention further provides a method of screening for a compound that specifically binds 

to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-66, c) a biologically active fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1-66, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-66. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

25 The invention further provides a method of screening for a compound that modulates the 

activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-66, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-66, c) a biologically active fragment of an amino 

30 acid sequence selected from the group consisting of SEQ ED NO: 1-66, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-66. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 

35 test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
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change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
5 sequence selected from the group consisting of SEQ ID NO:67-l 32, the method comprising a) . 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
10 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:67-132, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity 
to a polynucleotide sequence selected from the group consisting of SEQ ID NO:67-132, Hi) a 
15 polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:67-132, ii) a naturally occurring polynucleotide sequence having at least 
20 70% sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:67-132, iii) a polynucleotide sequence complementary to i), iv) a polynucleotide sequence 
complementary to ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide 
comprises a fragment of the above polynucleotide sequence; c) quantifying the amount of 
hybridization complex; and d) comparing the amount of hybridization complex in the treated 
25 biological sample with the amount of hybridization complex in an untreated biological sample, 
wherein a difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

30 Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 

clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding GBAP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of GBAP. 

35 Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 
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patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 

encoding GBAP were isolated. 
5 Table 5 shows the tools, programs, and algorithms used to analyze ^polynucleotides and 

polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
10 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
15 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
20 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
25 connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"GBAP" refers to the amino acid sequences of substantially purified GBAP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
30 human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
GBAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of GBAP either by directly interacting with 
GBAP or by acting on components of the biological pathway in which GBAP participates. 
35 An "allelic variant" is an alternative form of the gene encoding GBAP. Allelic variants may 
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result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered A gene may have none, one, or 
many allelic variants of its naturally occurring form Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
5 Each of these types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

"Altered" nucleic acid sequences encoding GB AP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as GBAP or a 
polypeptide with at least one functional characteristic of GBAP. Included within this definition are 

10 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding GBAP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding GBAP. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent GBAP. Deliberate 

15 amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of GBAP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 

20 include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

25 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

30 in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
GBAP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of GBAP either by 
directly interacting with GBAP or by acting on components of the biological pathway in which GBAP 
35 participates. 
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The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind GB AP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 

5 to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

10 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

1 5 The torn "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 

20 modified bases such as 5-methyl cytosine, Z-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translatioa The designation "negative" or "minus" can refer to the antisense strand, and the 

25 designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic GBAP, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 

30 antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
35 given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
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amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding GBAP or fragments of GB AP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
5 deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.), 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
10 one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison Wl) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
15 interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table bdow shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 

Conservative Substitution 



20 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


25 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Lai, Val 


30 


Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


35 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



40 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
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side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
5 modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived 

10 A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 

measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of GBAP or the polynucleotide encoding GBAP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 

15 fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 

20 from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:67-l 32 comprises a region of unique polynucleotide sequence that 
25 specifically identifies SEQ ID NO:67-132, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO.67-132 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:67-132 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:67-l 32 and the region of SEQ ID NO:67-l 32 to which the fragment corresponds are routinely 
30 determinable by one of ordinary skill in the art based on the intended purpose for the fragment 
A fragment of SEQ ID NO:l -66 is encoded by a fragment of SEQ ID NO:67-l 32. A 
fragment of SEQ ID NO: 1-66 comprises a region of unique amino acid sequence that specifically 
identifies SEQ ID NO: 1 -66. For example, a fragment of SEQ ID NO: 1 -66 is useful as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-66. 
35 The precise length of a fragment of SEQ ID NO: 1 -66 and the region of SEQ ID NO: 1 -66 to which the 
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. fragment corresponds are routinely determinable by one of ordinary skill in the art based on the 
intended puipose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a traaslation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full- 
5 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 

10 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Pa-cent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 

15 alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. .0992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 

20 selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 

25 sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlranih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 

30 Sequences" can be accessed and used interactively at htQ)7/www.ncbi.nlrn.nih.gov/gor£1)12.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

35 Matrix: BLOSUM62 
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Reward for match: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
5 Expect: 10 

Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, tor example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

10 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured 

15 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protda 

The phrases "pa-cent identity" and "% identity," as applied to polypeptide sequences, refer to 

20 the percentage of residue matches between at least two polypeptide sequences aligned using a 

standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

25 Pa-cent identity between polypeptide sequences may be determined using the default parameters 

of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 

30 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 

Alternatively the NCB1 BLAST software suite may be used For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(Apr-2 1-2000) with blastp set at default parameters. Such default parameters may be, for example: 

35 Matrix: BLOSUM62 
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Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
5 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
10 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 

15 chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 

20 complementary strand through base pairing under defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 

25 binding between pairs of nucleic acid strands that are not perfectly matched Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 

30 SDS, and about 100 pg/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 

35 target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
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for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
5 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 Mg/ml. Organic solvent, such as 
10 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DN A hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
1 5 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
20 or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
25 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of GBAP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
30 GBAP which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
35 The term "modulate" refers to a change in the activity of GBAP. For example, modulation 
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may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of GBAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
5 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
10 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
15 preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an GBAP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
20 cell type depending on the enzymatic milieu of GBAP. 

"Probe" refers to nucleic acid sequences encoding GBAP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
25 short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
30 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 
35 Methods for preparing and using probes and primers are described in the references, for 
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example Sambrook, J. et aL. 1989, Moleadar Cloning A Laboratory Manual, 2" 1 «L, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al., 1990, PCR 
Protocols. A Guide- to Methods and Applications . Academic Press, San Diego CA PCR primer pairs 
5 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
10 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5.000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
15 and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "niispriming library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
20 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
25 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

30 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra- The term recombinant includes nucleic acids that have 

35 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid Frequently, a 
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recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
5 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3' untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

10 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 

chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 

other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
15 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 

nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 

instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 

acids encoding GB AP, or fragments thereof, or GBAP itself, may comprise a bodily fluid; an extract 
20 from a cell, chromosome, organdie, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 

cDN A, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between, a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
25 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 

A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 

reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
30 removed from their natural environment and are isolated or separated, and are at least 60% free, 

preferably at least 75% free, and most preferably at least 90% free from other components with which 

they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 

different amino acid residues or nucleotides, respectively. 
35 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
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chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound 

A 'transcript image" refers to the collective pattern of gene expression by a particular cell type 
5 or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 

10 of host ceil being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

15 A "transgenic organism," as used herein, is any organism, including but not limited to 

animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 

20 a recombinant virus. Hie term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 

25 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence i dentity to the particular nucleic acid sequence over a certain length of one of the 

30 nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 

35 identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
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due to alternative splicing of exons during mRN A processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
5 variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative pf, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

10 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

15 THE INVENTION 

The invention is based on the discovery of new human GTP-binding associated proteins 
(GBAP), the polynucleotides encoding GB AP, and the use of these compositioas for the diagnosis, 
treatment, or prevention of immune system, reproductive, nervous system, and cell signaling disorders, 
and cell proliferative disorders including cancer. 

20 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

GBAP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each GBAP were identified, and column 4 shows the cDNA libraries from which 
these clones were isolated Column 5 shows Incyte clones and their corresponding cDNA libraries. 

25 Clones for which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some 
cases, GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank 
cDNA sequences, where indicated, in column 5 were used to assemble the consensus nucleotide 
sequence of each GBAP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 

30 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied. The methods of 

35 column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 
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The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding GB AP. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:67-132 and 
5 to distinguish between SEQ ID NO:67-132 and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
categories which express GB AP as a fraction of total tissues expressing GB AP. Column 4 lists 
diseases, disorders, or conditions associated with those tissues expressing GBAP as a fraction of total 
tissues expressing GBAP. Column 5 lists the vectors used to subclone each cDNA library. Of 
10 particular note is the expression of SEQ ID NO:84 in lung tissues, and the tissue-specific expression of 
SEQ ID NO: 1 32. Over 90% of tissues expressing SEQ ID NO: 1 32 are derived from the nervous 
system, particularly the brain. 

The columns of Table 4 show descriptions of the tissues used to construct the cDN A libraries 
from which cDNA clones encoding GBAP were isolated Column 1 references the nucleotide SEQ ID 
15 NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:70 maps to chromosome 7 within the interval from 1 1 1.6 to 123.4 centiMorgans. 
This interval contains a gene that is down regulated in adenoma. SEQ ID NO:74 maps to chromosome 
1 1 within the interval from 1 04.8 to 1 23.5 centiMorgans. This interval contains a gene associated with 
20 the cerebellar degenerative disorder, ataxia telangiectasia. SEQ ID NO:75 maps to chromosome 17 
within the interval from 62.9 to 65.0 centiMorgans. SEQ ID NO:77 maps to chromosome 3 within the 
interval from 12.9 to 16.5 centiMorgans. SEQ ID NO:80 maps to chromosome 9 within the interval 
from 42.0 to 57.3 centiMorgans. SEQ ID NO:86 maps to chromosome 1 within the interval from 159.6 
to 164.1 centiMorgans. SEQ ID NO:87 maps to chromosome 1 1 within the interval from 147.2 to 
25 151 .6. SEQ ID NO:90 maps to chromosome 1 within the interval from 219.2 to 223.0 centiMorgans. 
This interval contains a gene encoding a RAB interacting proteia SEQ ID NO:92 and SEQ ID 
NO:106 both map to chromosome 1 within the interval from 48.8 to 81.6 centiMorgans. This interval 
also contains genes associated with familial hypercholesterolemia, glucose transport defect, infantile 
hypophosphatasia, infantile neuronal ceroid lipoftiscinosis, Kostmann disease, multiple epiphyseal 
30 dysplasia, porphyria cutanea tarda, and T-cell acute lymphocytic leukemia 1 . SEQ ID NO:93 maps to 
chromosome 1 2 within the interval from 76.5 to 87.6 centiMorgans. This interval also contains genes 
associated with mucopolysaccharidosis type IUD, pseudovitamin D deficiency rickets, and renal 
amyloidosis. SEQ ID NO:94 and SEQ ID NO:109 both map to chromosome 1 within the interval from 
143.1 to 146.6 centiMorgans, to chromosome 14 within the interval from 46.8 to 50.9 centiMorgans, to 
35 chromosome 16 within the interval from 88.1 to 90.2 centiMorgans, and to chromosome 19 within the 
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interval from 58.7 to 97.5 centiMorgans. The interval on chromosome 14 from 46.8 to 50.9 
centiMorgans also contains a gene associated with dopa-responsive dystonia. The interval on 
chromosome 19 from 58.7 to 97.5 centiMorgans also contains genes associated with colorectal cancer, 
DNA ligase I deficiency, glutaricaciduria IIB, myotonic dystrophy, renal amyloidosis, T-cell acute 
5 lymphoblastic leukemia, and xeroderma pigmentosum D. SEQ ID NO:97 maps to chromosome 2 
within the interval from 236.2 to 269.5 centiMorgans. This interval also contains genes associated with 
Crigler-Najjar syndrome, familial hypercholesterolemia, Oguchi disease, and primary hyperoxaluria. 
SEQ ID NO: 101 maps to chromosome 2 within the interval from 225.6 to 233.1 centiMorgans, to 
chromosome 6 within the interval from 132.7 to 144.4 centiMorgans, and to chromosome 1 1 within the 

10 interval from 117.9 to 120.8 centiMorgans. The interval on chromosome 2 from 225.6 to 233.1 
centiMorgans also contains a gene associated with Waardenburg syndrome 1 . The interval on 
chromosome 6 from 132.7 to 144.4 centiMorgans also contains genes associated with familial 
disseminated atypical mycobacterial infection and rhizomdic chondrodysplasia punctata. The interval 
on chromosome 1 1 from 1 17.9 to 120.8 centiMorgans also contains a gene associated with acute 

15 intermittent porphyria. SEQ ID NO:l 1 1 maps to chromosome 19 within the interval from 35.5 to 49.4 
centiMorgans, to chromosome 1 within the interval from the p-terrainus to 16.4 centiMorgans, and to 
chromosome 1 1 within the interval from 147.2 centiMorgans to the q-terminus. SEQ ID NO:l 12 maps 
to chromosome 19 within the interval from 41.7 to 49.4 centiMorgans. SEQ ID NO:l 13 maps to 
chromosome 9 within the interval from 136.2 to 163.0 centiMorgans. SEQ ID NO:l 15 maps to 

20 chromosome 14 within the interval from 95.5 to 103.7 centiMorgans and to the X chromosome (23) 
within the interval from the p-terminus to 55.5 centiMorgans. SEQ ID NO:l 17 maps to chromosome 
13 at 46.9 centiMorgans. SEQ ID NO:l 18 maps to chromosome 1 within the interval from 16.4 to 
22.9 centiMorgans. SEQ ID NO: 121 maps to chromosome 12 within the interval from 1 16.6 to 1 18.9 
centiMorgans. SEQ ID NO: 128 maps to chromosome 1 within the interval from the p-terminus to 16.4 

25 centiMorgans. 

The invention also encompasses GB AP variants. A preferred GBAP variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the GBAP amino acid sequence, and which contains at least one functional or structural 
characteristic of GBAP. 

30 The invention also encompasses polynucleotides which encode GBAP. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:67-132, which encodes GBAP. The polynucleotide sequences of 
SEQ ID NO:67-l 32, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 

35 is composed of ribose instead of deoxyribose. 
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The invention also encompasses a variant of a polynucleotide sequence encoding GBAP. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding GBAP. A particular aspect of the invention encompasses a variant of a polynucleotide 
5 sequence comprising a sequence selected from the group consisting of SEQ ID NO:67-132 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:67-l 32. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of GBAP. 
10 It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

code, a multitude of polynucleotide sequences encoding GBAP, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
15 accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring GBAP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode GBAP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring GBAP under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding GBAP or its 
20 derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding 
GBAP and its derivatives without altering the encoded amino acid sequences include the production of 
25 RNA transcripts having more desirable properties, such as a greater half-life, than transcripts produced 
from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode GBAP and 
GBAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
30 using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding GBAP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:67-132 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
35 S.L. Berger (1987) Methods Enzymol. 152:399^07; Kimmel, A.R. (1987) Methods Enzymol. 
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152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 

5 DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE Biosystems, 
Foster City C A), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the ELONG ASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 

10 PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (PE 
Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system 
(PE Biosystems), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale 
C A), or other systems known in the art. The resulting sequences are analyzed using a variety of 
algorithms which are well known in the art. (See, e.g., Ausubd, F.M. (1997) Short Protocols in 

15 Molecular Bioloev , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular 
Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding GB AP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory dements. For example, one method which may be employed, 

20 restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 

25 Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-1 19.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 

30 are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, 

35 Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
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GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDN As. In addition, random-primed libraries, which often include 
5 sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5* 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
10 sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
15 controlled Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode GB AP may be cloned in recombinant DNA molecules that direct expression of GBAP, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
20 the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express GBAP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter GB AP-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
25 shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
30 as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat Biotechnol. 17:793-797; Christians, F.C. et at (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of GBAP, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
35 variants is produced using PCR-mediated recombination of gene fragments. The library is then 
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subjected to selection or screening procedures that. identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
5 point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

10 In another embodiment, sequences encoding GBAP may be synthesized, in whole or in part, 

using chemical methods well known in the art (See, eg.. Caruthers. M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn. T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively. 
GBAP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
15 Creighton, T. (1984) Proteins. Structure an d Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
GBAP, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
20 sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See. e.g.. Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 
25 In order to express a biologically active GBAP, the nucleotide sequences encoding GBAP or 

derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences 
30 encoding GBAP. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding GBAP. Such signals 
include the ATG initiation codon and adjacent sequences, eg. the Kozak sequence. In cases where 
sequences encoding GBAP and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
35 needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
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translational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation codons may be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
5 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding GBAP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A Laboratory 

10 Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding GBAP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 

15 yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g„ cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 

20 Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. etal. (1994) Proc. Natl. 
Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

25 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 :3655-3659; and Harrington, 
J.J. et al. (1997) Nat. Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or hopes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. . 

30 USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding GBAP. For example, routine cloning, 

35 subcloning, and propagation of polynucleotide sequences encoding GBAP can be achieved using a 
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multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding GB AP into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 

5 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
deletions in the cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of GB AP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of GB AP may be used. For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used 

10 Yeast expression systems may be used for production of GBAP. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagatioa (See, e.g., Ausubel, 1995, supra : 

15 Bitter, supra : and Scorer, supra .) 

Plant systems may also be used for expression of GBAP. Transcription of sequences encoding 
GBAP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 

20 used (See, e.g., Coruzzi, supra : Broglie, supra ; and Winter, supra .) These constructs can be 

introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, e.g., 
The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized In cases 
25 where an adenovirus is used as an expression vector, sequences encoding GBAP may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses GBAP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
30 virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expressioa 

Human artificial chromosomes (HACs) may also be employed to deliver largo - fragments of 
DNA than can be contained in and expressed from a plasmid HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
35 or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat Genet. 15:345-355.) 
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For long term production of recombinant proteins in mammalian systems, stable expression of 
GBAP in cell lines is preferred. For example, sequences encoding GB AP can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
5 introduction of ihe vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

10 Any number of selection systems may be used to recover transformed cell lines. These include, 

but are not limited to, the hopes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk' and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 

15 resistance to the aminoglycosides neomycin and G-41 8; and als and pat confer resistance to 

chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M et al. (1980) 
Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. MoL Biol. 150:1-14.) 
Additional selectable genes have been described, e.g„ trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad Sci. USA 

20 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 8 
glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (See, e.g., Rhodes, C.A 
(1995) Methods Mol. Biol. 55:121-131.) 

25 Although the presence/absence of marker gene expression suggests that the gene of interest is 

also present, the presence and expression of the gene may need to be confirmed For example, if the 
sequence encoding GBAP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding GBAP can be identified by the absence of marker gene functioa Alternatively, a 
marker gene can be placed in tandem with a sequence encoding GBAP under the control of a single 

30 promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding GBAP and that express 
GBAP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and 

35 protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
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technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of GBAP using either 
specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques include 
enzyme-linked immunosorbent assays (ELIS As), radioimmunoassays (RIAs), and fluorescence 
5 activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on GB AP is preferred, but a competitive binding 
assay may be employed These and other assays are well known in the art. (See, e.g„ Hampton, R. et 
al. (1990) Serological Methods, a Lab oratory Manual . APS Press, St Paul MN, Sect IV; Coligan, J.E. 
et al. (1997) Current Prot ocols in Immunology . Greene Pub. Associates and Wiley-Interscience, New 
10 York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding GB AP include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, the 

15 sequences encoding GBAP, or any fragments thereof, may be cloned into a vector for the production of 
an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 

20 Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding GBAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 

25 produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode GBAP may be designed to contain signal sequences which direct 
secretion of GBAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 

30 inserted sequences cm- to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 

35 (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from ihe American Type Culture 
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Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding GBAP may be ligated to a heterologous sequence resulting in translation of a fusion 
5 protein in any of the aforementioned host systems. For example, a chimeric GBAP protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of GBAP activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

10 binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 

15 these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the GBAP encoding sequence and the heterologous protein sequence, so that GBAP 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1 995, supra , ch. 1 0). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

20 In a further embodiment of the invention, synthesis of radiolabeled GBAP may be achieved in 

vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

25 GBAP of the present invention or fragments thereof may be used to screen for compounds 

that specifically bind to GBAP. At least one and up to a plurality of test compounds may be screened 
for specific binding to GBAP. Examples of test compounds include antibodies, oligonucleotides, 
proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 

30 GBAP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which GBAP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
. 35 these compounds involves producing appropriate cells which express GBAP, either as a secreted 
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protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophilau or E. 
coli . Cells expressing GB AP or cell membrane fractions which contain GB AP are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either GB AP or the 
compound is analyzed. 

5 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with GB AP, either in 
solution or affixed to a solid support, and detecting the binding of GB AP to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

10 labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

GB AP of the present invention or fragments thereof may be used to screen for compounds 
that modulate the activity of GBAP. Such compounds may include agonists, antagonists, or partial or 

15 inverse agonists. In one embodiment, an assay is performed under conditions permissive for GBAP 
activity, wherein GBAP is combined with at least one test compound, and the activity of GBAP in the 
presence of a test compound is compared with the activity of GBAP in the absence of the test 
compound. A change in the activity of GBAP in the presence of the test compound is indicative of a 
compound that modulates the activity of GBAP. Alternatively, a test compound is combined with an 

20 in vitro or cell-free system comprising GBAP undo- conditions suitable for GBAP activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of GBAP 
may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding GBAP or their mammalian homologs may 

25 be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 

30 interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 

35 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding GBAP may also be manipulated in vitro in ES cells derived from 

5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et al. 
(1998) Science 282:1 145-1 147). 

Polynucleotides encoding GBAP can also be used to create "knockin" humanized animals 

10 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding GBAP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 

15 Alternatively, a mammal inbred to overexpress GBAP, e.g., by secreting GBAP in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of GBAP and GTP-binding associated proteins. In addition, the expression of GBAP 

20 is closely associated with reproductive tissues, inflammation and the immune response, trauma, cell 
proliferation, and cancer. Therefore, GBAP appears to play a role in immune system, reproductive, 
nervous system, and cell signaling disorders, and cell proliferative disorders including cancer. In the 
treatment of disorders associated with increased GBAP expression or activity, it is desirable to 
decrease the expression or activity of GBAP. In the treatment of disorders associated with decreased 

25 GBAP expression or activity, it is desirable to increase the expression or activity of GBAP. 

Therefore, in one embodiment, GBAP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of GBAP. Examples of such disorders include, but are not limited to, an immune system 
disorder such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), 

30 Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic 
dermatitis, dermatomyositis, diabetes mellitus, emphysema, erythroblastosis fetalis, erythema 
nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 

35 Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable 
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bowel syndrome, episodic lymphopenia with lymphocytotoxins, mixed connective tissue disease 
(MCTD), multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
- myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, 

Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 
5 systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic 
purpura, ulcerative colitis, uveitis, Wemer syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, trauma, and hematopoietic cancer including lymphoma, leukemia, and 
myeloma; a reproductive disorder such as a disorder of prolactin production, infertility, including 
tubal disease, ovulatory defects, and endometriosis, a disruption of the estrous cycle, a disruption of 
10 the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an endometrial 
or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and teratogenesis, 
cancer of the breast, fibrocystic breast disease, and galactorrhea, a disruption of spermatogenesis, 
abnormal sperm physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, 
prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; a nervous 
15 system disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
20 abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
system disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
25 nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, akathesia, amnesia, catatonia, diabetic neuropathy, 
30 tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 
cell signaling disorder including endocrine disorders such as disorders of the hypothalamus and 
pituitary resulting from lesions such as primary brain tumors, adenomas, infarction associated with 
pregnancy, hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, 
immunological disorders, and complications due to head trauma; disorders associated with 
35 hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma; disorders associated with 
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hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection; 
disorders associated with hyperparathyroidism including Conn disease (chronic hypercalemia); 
pancreatic disorders such as Type I or Type II diabetes mellitus and associated complications; 
disorders associated with the adrenals such as hyperplasia, carcinoma, or adenoma of the adrenal 
5 cortex, hypertension associated with alkalosis; disorders associated with gonadal steroid hormones 
such as: in women, abnormal prolactin production, infertility, endometriosis, perturbations of the 
menstrual cycle, polycystic ovarian disease, hvperprolactinemia, isolated gonadotropin deficiency, 
amenorrhea, galactorrhea, hermaphroditism, hirsutism and virilization, breast cancer, and, in post- 
menopausal women, osteoporosis; and, in men, Leydig cell deficiency, male climacteric phase, and 
10 germinal cell aplasia, hypergonadal disorders associated with Leydig cell tumors, androgen resistance 
associated with absence of androgen receptors, syndrome of 5 o-reductase, and gynecomastia; and a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
15 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing GBAP or a fragment or derivative 
20 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of GBAP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
GBAP in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or 
prevent a disorder associated with decreased expression or activity of GBAP including, but not limited 

25 to, those provided above. 

In still another embodiment, an agonist which modulates the activity of GBAP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of GBAP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of GBAP may be administered to a subject to treat or 
30 prevent a disorder associated with increased expression or activity of GBAP. Examples of such 
disorders include, but are not limited to, those immune system, reproductive, nervous system, and cell 
signaling disorders, and cell proliferative disorders including cancer, described above. In one aspect, an 
antibody which specifically binds GBAP may be used directly as an antagonist or indirectly as a 
targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express 
35 GBAP. 
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In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding GBAP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of GBAP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
5 sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 

10 lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of GBAP may be produced using methods which are generally known in the art 
In particular, purified GBAP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind GBAP. Antibodies to GBAP may also be generated 
using methods that are well known in the art Such antibodies may include, but are not limited to, 

15 polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced 
by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are 
generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with GBAP or with any fragment or oligopeptide thereof 

20 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacteriuin parvum are especially preferable. 

25 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to GBAP 

have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural proteia Short stretches of GBAP amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 

30 may be produced. 

Monoclonal antibodies to GBAP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
35 Immunol. Methods 81 :31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
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Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

5 Natl. Acad Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce GBAP-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 

10 D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Qrlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1 991) Nature 349:293-299.) 

15 Antibody fragments which contain specific binding sites for GBAP may also be generated For 

example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab r )2 
fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 

20 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between GBAP and its specific 

25 antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering GBAP epitopes is generally used, but a competitive binding assay may also be employed 
(Pound supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for GBAP. Affinity is expressed as an association 

30 constant, K,, which is defined as the molar concentration of GB AP-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 
GBAP epitopes, represents the average affinity, or avidity, of the antibodies for GBAP. The K, 
determined for a preparation of monoclonal antibodies, which are monospecific for a particular GBAP 

35 epitope, represents a true measure of affinity. High-affinity antibody preparations with ranging from 
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about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the GBAP-antibody complex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 
about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar procedures which 
ultimately require dissociation of GBAP, preferably in active form, from the antibody (Catty, D. (1988) 
5 Antibodies. Volume I: A Prac tical Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer 
(1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
10 specific antibody/ml, is generally employed in procedures requiring precipitation of GBAP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al„ supra .) 

In another embodiment of the invention, the polynucleotides encoding GBAP, or any fragment 

15 or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding GBAP. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding GBAP. 

20 (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics . Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

25 Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, KJ. et al. (1995) 
9(1 3): 1 288- 1 296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271 ; Ausubel, supra; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3): 323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

30 systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding GBAP may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
35 (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X-linked 
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inheritance (Cavazzana-Calvo, M et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner t J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
5 TTierapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VTTI or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 

10 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis : and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi) . In the 
case where a genetic deficiency in GBAP expression or regulation causes disease, the expression of 

1 5 GBAP from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in GBAP 
are treated by constructing mammalian expression vectors encoding GBAP and introducing these 
vectors by mechanical means into GBAP-deficient cells. Mechanical transfer technologies for use with 

20 cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold 
particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the 
use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; 
Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. andH. R6cipon (1998) Curr. Opia Biotechnol. 9:445- 
450). 

25 Expression vectors that may be effective for the expression of GBAP include, but are not 

limited to, thePCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). GBAP may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 

30 (RS V), S V40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g. t the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in Die T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 

35 FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
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and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding GB AP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
5 polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

10 In another embodiment of the invention, diseases or disorders caused by genetic defects with 

respect to GBAP expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding GBAP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus as-acting RNA sequences and coding sequences 

15 required for efficient vector propagatioa Retrovirus vectors (e.g„ PFB and PFBNEO) are 

commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VS Vg (Armentano, D. et al. 

20 (1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, MA. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 

25 Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 
return of transduced cells to a patient are procedures well known to pa-sons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71 :7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 
(1998) Proc. Natl. Acad Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

30 In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 

polynucleotides encoding GBAP to cells which have one or more genetic abnormalities with respect to 
the expression of GBAP. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 
versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
35 (Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
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described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorparaled by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. 
Rev. Nutr. 19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

5 In another alternative, a hopes-based, gene therapy delivery system is used to deliver 

polynucleotides encoding GB AP to target cells which have one or more genetic abnormalities with 
respect to the expression of GBAP. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing GBAP to cells of the central nervous system, for which HSV has a 
tropism The construction and packaging of hopes-based vectors are well known to those with 

10 ordinary skill in the art. A replication-competent hopes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res.l69:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant HSV 

15 d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 andXu, H. et al. (1994) Dev. 
Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 

20 sequences, the generation of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large hopes virus genomes, the growth and propagation of 
herpesvirus, and the infection of ceils with herpesvirus are techniques well known to those of ordinary 
skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
25 deliver polynucleotides encoding GBAP to target cells. The biology of the prototype alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 
the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During alphavirus 
RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
30 overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease 
and polymerase). Similarly, inserting the coding sequence for GBAP into the alphavirus genome in 
place of the capsid-coding region results in the production of a large number of GB AP-coding RNAs 
and the synthesis of high levels of GBAP in vector transduced cells. While alphavirus infection is 
typically associated with cell lysis within a few days, the ability to establish a persistent infection in 
35 hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic 
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replication of alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. 
et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of 
GBAP into a variety of cell types. The specific transduction of a subset of cells in a population may 
require the sorting of cells prior to transductioa The methods of manipulating infectious cDNA clones 
5 of alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus 
infections, are well known to those with ordinary skill in the art 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
10 inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using iriplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, ML Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
15 preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
20 endonucleolytic cleavage of sequences encoding GBAP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 1 5 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
25 secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
30 chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding GBAP. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or 
35 tissues. 
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RN A molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
5 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding GBAP. Compounds 
10 which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
15 polynucleotide expression. Thus, in the treatment of disorders associated with increased GBAP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding GBAP may be therapeutically useful, and in the treament of disorders associated with 
decreased GBAP expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding GBAP may be therapeutically useful. 
20 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
25 based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding GBAP is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding GBAP are assayed 
30 by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding GBAP. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
35 exposed to a test compound indicates that the test compound is effective in altering the expression of 
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the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schirosaccharomvces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, CM. et al. (2000) Nucleic Acids 
Res. 28.E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
5 Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

10 Many methods for introducing vectors into cells or tissues are available and equally suitable for 

use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by pofycationic amino polymers may be achieved 
using methods which are well known in the art (See, eg., Goldman, C.K. et al. (1997) Nat. 

15 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
20 composition which generally comprises an active ingredient formulated with a pharmaceutically 
acceptable excipienL Excipients may include, for example sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest edition 
nf Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such pharmaceutical 
compositions may consist of GBAP, antibodies to GBAP, and mimetics. agonists, antagonists, or 

25 inhibitors of GBAP. 

The pharmaceutical compositions utilized in this invention may be administered by any number 
of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, 
topical, sublingual, or rectal means. 

30 Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 

powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient In the case of small molecules (eg. traditional low molecular weight organic drugs), aerosol 
delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (eg. larger 
peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar region of 

35 the lung have enabled the practical delivery of drugs such as insulin to blood circulation (see, eg.. 
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Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of administration 
without needle injection, and obviates the need for potentially toxic penetration enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein the 
active ingredients are contained in an effective amount to achieve the intended purpose. The 
5 determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
delivery of macromolecules comprising GB AP or fragments thereof. For example, liposome 
preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, GBAP or a fragment thereof may be joined to a short 
10 cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 
found to transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, 
S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
15 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example GBAP 
or fragments thereof, antibodies of GBAP, and agonists, antagonists or inhibitors of GBAP, which 
20 ameliorates the symptoms or condition Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical compositions which 
25 exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with little 
or no toxicity. The dosage varies within this range depending upon the dosage form employed, the 
sensitivity of the patient, and the route of administration. 
30 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
35 Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
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biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 //g to 1 00,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
5 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind GB AP may be used for the diagnosis 

10 of disorders characterized by expression of GBAP, or in assays to monitor patients being treated with 
GBAP or agonists, antagonists, or inhibitors of GBAP. Antibodies useful for diagnostic purposes may 
be prepared in the same manner as described above for therapeutics. Diagnostic assays for GBAP 
include methods which utilize the antibody and a label to detect GBAP in human body fluids or in 
extracts of cells or tissues. The antibodies may be used with or without modification, and may be 

1 5 labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter 
molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring GBAP, including ELIS As, RI As, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of GBAP expression. Normal 
or standard values for GBAP expression are established by combining body fluids or cell extracts taken 

20 from normal mammalian subjects, for example, human subjects, with antibody to GBAP under 
conditions suitable for complex formatioa The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of GBAP expressed in subject, 
control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease 

25 In another embodiment of the invention, the polynucleotides encoding GBAP may be used for 

diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of GBAP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 

30 GBAP, and to monitor regulation of GBAP levels during therapeutic interventioa 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding GBAP or closely related molecules may be used to 
identify nucleic acid sequences which encode GBAP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

35 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
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probe identifies only naturally occurring sequences encoding GBAP, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the GBAP encoding sequences. The hybridization probes of the subject 
5 invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:67-l 32 or from 
genomic sequences including promoters, enhancers, and introns of the GBAP gene. 

Means for producing specific hybridization probes for DNAs encoding GBAP include the 
cloning of polynucleotide sequences encoding GBAP or GBAP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 

10 be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 

polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 
alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding GBAP may be used for the diagnosis of disorders 

15 associated with expression of GBAP. Examples of such disorders include, but are not limited to, an 
immune system disorder such as inflammation, actinic keratosis, acquired immunodeficiency 
syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic 
anemia, autoimmune thyroiditis, bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, 

20 Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, erythroblastosis 
fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, 
Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, 
hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with lymphocytotoxins, mixed 
connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, myocardial or pericardial 

25 inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, 

polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, trauma, and hematopoietic cancer including lymphoma, 

30 leukemia, and myeloma; a reproductive disorder such as a disorder of prolactin production, infertility, 
including tubal disease, ovulatory defects, and endometriosis, a disruption of the eslrous cycle, a 
disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an 
endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and 
teratogenesis, cancer of the breast, fibrocystic breast disease, and galactorrhea, a disruption of 
35 spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the prostate, benign 
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prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and 
gynecomastia; a nervous system disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
5 disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and 
Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 

10 of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

15 endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, akathesia, amnesia, catatonia, diabetic neuropathy, 
tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 
cell signaling disorder including endocrine disorders such as disorders of the hypothalamus and 
pituitary resulting from lesions such as primary brain tumors, adenomas, infarction associated with 

20 pregnancy, hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, 
immunological disorders, and complications due to head trauma; disorders associated with 
hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma; disorders associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection; 

25 disorders associated with hypoparathyroidism including Conn disease (chronic hypercalemia); 
pancreatic disorders such as Type I or Type II diabetes mellitus and associated complications; 
disorders associated with the adrenals such as hyperplasia, carcinoma, or adenoma of the adrenal 
cortex, hypertension associated with alkalosis; disorders associated with gonadal steroid hormones 
such as: in women, abnormal prolactin production, infertility, endometriosis, perturbations of the 

30 menstrual cycle, polycystic ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, 
amenorrhea, galactorrhea, hermaphroditism, hirsutism and virilization, breast cancer, and, in post- 
menopausal women, osteoporosis; and, in men, Leydig cell deficiency, male climacteric phase, and 
germinal cell aplasia, hypergonadal disorders associated with Leydig cell tumors, androgen resistance 
associated with absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; and a 

35 cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
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hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
5 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus . The polynucleotide 
sequences encoding GB AP may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered GBAP expressioa 
Such qualitative or quantitative methods are well known in the art 

10 In a particular aspect, the nucleotide sequences encoding GBAP may be useful in assays that 

detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding GBAP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

15 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding GBAP in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient 

20 In order to provide a basis for the diagnosis of a disorder associated with expression of GBAP, 

a normal or standard profile for expression is established This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding GBAP, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 

25 values from an experiment in which a known amount of a substantially purified polynucleotide is used 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

30 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either undo-- or 
35 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
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of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

5 Additional diagnostic uses for oligonucleotides designed from the sequences encoding GB AP 

may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 
GBAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding GB AP, and 
will be employed under optimized conditions for identification of a specific gene or condition. 

10 Oligomers may also be employed under less stringent conditions for detection or quantification of 
closely rdated DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding GBAP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 

15 in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding GBAP are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 

20 and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as 
DNA sequencing machines. Additionally, sequence database analysis methods, tamed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 

25 overlapping DNA fragments which assemble into a common consensus sequence. These compute- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

30 Methods which may also be used to quantify the expression of GBAP include radiolabeling or 

biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples maybe 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 

35 interest is presented in various dilutions and a speclrophotometric or colorimetric response gives rapid 
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quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 

5 numbers of genes simultaneously as described in Seilhamer, J.J. et al, "Comparative Gene Transcript 
Analysis/' U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to devdop and monitor the 

10 activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/ho* pharmacogenomic profile. 
In another embodiment, antibodies specific for GBAP, or GBAP or fragments thereof may be 

15 used as dements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment rdates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cdl type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 

20 quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Conparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or thdr complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cdl type. In one embodiment, the 

25 hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of dements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 

30 case of a tissue or biopsy sample, or in vitro , as in the case of a cdl line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 

35 molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
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(Nuwaysir, E.R et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
Toxicol. Lett. 112-1 13:467-471 , expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 

5 from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
10 toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at httpy/ww,nidis.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 

15 gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 

20 invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers tQ the global 

25 pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, Ihe separation is 

30 achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Sterner and Anderson, supra ). The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 

35 generally proportional to the level of the protein in the sample. The optical densities of equivalently 

55 



SUBSTITUTE SHEET (RULE 26) 



WO 01/05970 PCT/US00/19698 
positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identity any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
5 The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for GB AP to quantify the 
levels of GBAP expression. In one embodiment, the antibodies are used as elements on a microarray, 
10 and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
levels of protein bound to each array dement (Lueking, A. et al. (1999) Anal. Biochem. 270:103-111; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array dement. 
15 Toxicant signatures at the proteome levd are also useful for toxicological screening, and should 

be analyzed in paralld with toxicant signatures at the transcript levd. There is a poor correlation 
between transcript and protdn abundances for some protdns in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
20 proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing protdns with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified The amount of each protein 
25 is compared to the amount of the corresponding protdn in an untreated biological sample. A difference 
in the amount of protdn between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual protdns are identified by sequendng the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 
30 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies sperific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protdn in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 
35 Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
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Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
5 known and thoroughly described in DNA Microarr avs: A Practical Approach, M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding GB AP may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
10 preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
15 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
20 Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data. (See, e.g., Hdnz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding GBAP on a physical map 
25 and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 
associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
-Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
30 reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigate (See, e.g., 
35 Gatti, R. A et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
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also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 

among normal, carrier, or affected individuals. 

In another embodiment of the invention, GBAP, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
5 screening techniques. The fragment employed in such screening may be free in solution, affixed to a 

solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 

between GBAP and the agent being tested may be measured 

Another technique for drug screening provides for high throughput screening of compounds 

having suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. (1984) PCT 
10 application WO84/03564.) In this method, large numbers of different small test compounds are 

synthesized on a solid substrate. The test compounds are reacted with GBAP, or fragments thereof, and 

washed Bound GBAP is then detected by methods well known in the art. Purified GBAP can also be 

coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 

non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 
15 In another embodiment, one may use competitive drug screening assays in which neutralizing 

antibodies capable of binding GBAP specifically compete with a test compound for binding GBAP. In 

this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with GBAP. 

In additional embodiments, the nucleotide sequences which encode GBAP may be used in any 
20 molecular biology techniques that have yet to be developed, provided the new techniques rely on 

properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

description, utilize the present invention to its fullest extent The following preferred specific 
25 embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 

of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, in 

particular U.S. Ser. No. 60/144,595, U.S Ser. No. 60/150,460, and U.S. Ser. No. 60/159,849, are 

hereby expressly incorporated by reference. 

30 

EXAMPLES 

L Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
35 in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
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solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
5 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 
10 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

Ubraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T)or random primers. Synthetic 
15 oligonucleotide adapters were ligated to double stranded cDNA and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S 1000, SEPHAROSE CL2B. or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
20 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DH5o, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 
25 Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 

using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 
one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.AL. PREP 96 plasmid 
30 purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
35 well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
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PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
5 Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction with the HYDRA microdispensex (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 

10 PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 
separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art. Reading frames within the cDNA 

15 sequences were identified using standard methods (reviewed in Ausubel, 1997, supra , unit 7.7). Some 
of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 

20 references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 

25 two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

30 The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 

and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 

35 using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into full 
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length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 
and these full length sequences were subsequently analyzed by querying against databases such as the 
5 GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 
probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., 
Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 
10 amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:67-l 32. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
15 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound (See, e.g„ Sambrook, supra , ch. 7; Ausubd, 1995, 
supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search fa* identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membranes-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), length(Seq. 2)} 

25 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 limes the length of the shorter of the two sequences). The BLAST score is calculated by 

30 assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 

35 of the two sequences being compared A product score of 70 is produced either by 100% identity and 
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70% ova-lap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% ova-lap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding GB AP occurred. Analysis involved the categorization of cDNA libraries by 
5 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
10 Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
3. 

V. Chromosomal Mapping of GBAP Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:67-132 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 

15 implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:67-132 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 

20 had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:70, 74, 75, 77, 80, 86, 87, 90, 92, 93, 94, 97, 101, 
106, 109, 111, 112, 113, 115, 117, 118, 121, and 128 are described in The Invention as ranges, or 
intervals, of human chromosomes. More than one map location is reported for SEQ ID NO:94, 101 , 

25 109, 1 1 1, and 1 15, indicating that previously mapped sequences having similarity, but not complete 
identity, to SEQ ID NO:94, 101, 109, 1 1 1, and 1 15 were assembled into their respective clusters. 
The map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 

30 (Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by Gdndthon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. 

VI. Extension of GBAP Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:67-l 32 were produced by extension of an 
35 appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
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fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
5 temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided 

Sdected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed 

High fidelity amplification was obtained by PCR using methods well known in the art PCR 
10 was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each prima-, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , 
and P-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60 D C, 1 min; Step 4: 68°C, 
15 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for prima- pair T7 and SK+ were as follows: Step 1: 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 \i\ PICOGREEN 
20 quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 |il of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 fA to 10 /il aliquot of the reaction mixture was analyzed by electrophoresis 
25 on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 
The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CvUI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
30 gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37°C in 384-well plates in LB/2x 
35 carb liquid media. 
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The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 

5 reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAM1C 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (PE Biosystems). 

10 In like manner, the polynucleotide sequences of SEQ ID NO:67-132 are used to obtain 5' 

regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:67-l 32 are employed to screen cDNAs, 
15 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with largo- nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fxCi of [y- 32 P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
20 MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 1 0 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 
Pvu II (DuPont NEN). 

25 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 

30 compared. 

VIII. Microarrays 

Hie linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra), mechanical 
microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
35 technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
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substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and may 
5 contain any appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
ShaJon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 

10 selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 

15 desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 

Tissue or Cell Sample Preparation 

20 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/Ml RNase inhibitor, 500 \M dATP, 500 \M dGTP, 500 \M dTTP, 40 \M 
dCTP, 40 \M dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 

25 transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 

GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 

30 using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSC/0.2% SDS. 

35 Microarray Preparation 
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Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
5 ng. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
10 Scientific Products Corporation (VWR). West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 



oven. 



Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
15 concentration of 100 ng/ul, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
20 buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 'C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 ul of sample mixture consisting of 0.2 ug each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

25 mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 ul of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60°C. The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 

30 0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1X SSC), and dried. 



Detection 



Reporter-labeled hybridization complexes are delected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
35 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
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containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

5 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

10 although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDN A control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples 

15 from different sources (e.g., representing test and control cells), each labeled with a different 

fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 1 2-bit RH-835H analog-to-digital 

20 (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue Cow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 

25 emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

30 IX, Complementary Polynucleotides 

Sequences complementary to the GB AP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring GBAP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 

35 4.06 software (National Biosciences) and the coding sequence of GBAP. To inhibit transcription, a 
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complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the GB AP-encoding transcript 
X, Expression of GB AP 

5 Expression and purification of GB AP is achieved using bacterial or virus-based expression 

systems. For expression of GBAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 

10 dement Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express GBAP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of GBAP in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 

15 replaced with cDNA encoding GBAP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frueiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.KL et 

20 al. (1994) Proc. Natl. Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, GBAP is synthesized as a fusion protein with, e.g., glutathione S« 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 

25 enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from GBAP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 

30 His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 

(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 1 0 and 1 6). Purified GBAP obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 
XL Demonstration of GBAP Activity 

35 GTP-binding activity of GBAP is determined in an assay that measures the binding of GBAP 
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to a-P^-labeled GTP. Purified GBAP is first blotted onto filters and rinsed in a suitable buffer. The 
filters are then incubated in buffer containing radiolabeled a- 32 P-GTP. The filters are washed in buffer 
to remove unbound GTP and counted in a radioisotope counter. Non-specific binding is determined in 
an assay that contains a 100-fold excess of unlabeled GTP. The amount of specific binding is 
5 proportional to the activity of GBAP. 

GTPase activity of GBAP is determined in an assay that measures the conversion of a- 32 P-GTP 
to a- 32 P-GDP. GBAP is incubated with a- 32 P-GTP in buffer for an appropriate period of time, and the 
reaction is terminated by heating or acid precipitation followed by centrifugation. An aliquot of the 
supernatant is subjected to polyacrylamide gel electrophoresis (PAGE) to separate GDP and GTP 

10 together with unlabeled standards. The GDP spot is cut out and counted in a radioisotope counter. The 
amount of radioactivity recovered in GDP is proportional to GTPase activity of GBAP. 
XII. Functional Assays 

GBAP function is assessed by expressing the sequences encoding GBAP at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 

15 vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3. 1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5- 1 0 //g of recombinant vector are transiently transfected into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 //g of an additional plasmid containing sequences encoding a 

20 marker protein are co-transfected. Expression of a marker protein provides a means to distinguish . 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g„ Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 

25 apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DN A content as measured by staining of DNA with propidium iodide; changes in 
cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 

30 expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
(1994) Flow Cytometry , Oxford, New York NY. 

The influence of GBAP on gene expression can be assessed using highly purified populations of 

35 cells transfected with sequences encoding GBAP and either CD64 or CD64-GFP. CD64 and CD64- 
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GFP are expressed on the surface of transfected ceils and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRN A can be purified from the cells using methods well known by those of skill in the art. Expression 
5 of mRN A encoding GB AP and other genes of interest can be analyzed by northern analysis or 
microarray techniques. 

XIII. Production of GB AP Specific Antibodies 

GBAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 

10 immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the GBAP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 

15 described in the art. (See, e.g., Ausubel, 1995, supra , ch. 1 1 .) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma- Aldrich, St 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the oligopeptide-KLH 

20 complex in complete Freund's adjuvant Resulting antisera are tested for antipeptide and anti-GB AP 
activity by, for example, binding the peptide or GBAP to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XIV. Purification of Naturally Occurring GBAP Using Specific Antibodies 
Naturally occurring or recombinant GBAP is substantially purified by immunoaffinity 

25 chromatography using antibodies specific for GBAP. An immunoaffinity column is constructed by 
covalently coupling anti-GBAP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing GBAP are passed over the immunoaffinity column, and the column is washed 

30 undo- conditions that allow the preferential absorbance of GBAP (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/GBAP binding 
(e.g., a buffo- of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), 
and GBAP is collected 

XV. Identification of Molecules Which Interact with GBAP 

35 GBAP, or biologically active fragments thereof, are labeled with ,25 I Bolton-Hunter reagent. 
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(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled GBAP, washed, and 
any wells with labeled GBAP complex are assayed. Data obtained using different concentrations of 
GBAP are used to calculate values for the number, affinity, and association of GBAP with the 
5 candidate molecules. 

Alternatively, molecules interacting with GBAP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

GBAP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
10 which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

Various modifications and variations of the described methods and systems of the invention will 
15 be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
20 claims. 
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pINCY 
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pINCY 


pINCY 


PBLUESCRIPT 


PSPORTl 


pINCY 


Disease or Condition 
(Fraction of Total) 


Cancer (0.429) 
Inflammation/Trauma (0 . 524) 
Cell Proliferation (0.095) 


Cancer (0.444) 

Cell Proliferation (0.315) 

Inflammation/Trauma (0 , 278) 


Cancer (0.714) 
Inflammation/Trauma (0 . 142) 


Cancer (0.467) 

Cell Proliferation (0.244) 

Inflammation/Trauma (0 . 267) 


Cell Proliferation (0.400) 
Inflammation/Trauma (0.429) 
Cancer (0.314) 


Cancer (0.667) 

Cell Proliferation (0.143) 

Inf lammat ion/ Trauma (0.238) 


Cancer (0.422) 

Inf 1 ammat ion/Trauma (0.349) 

Cell Proliferation (0.205) 


Cancer (0.405) 

Cell Proliferation (0.270) 

Inflammation/Trauma (0 . 324) 


Cancer (0.509) 
Inflammation/Trauma (0.269) 
Cell Proliferation (0.157) 


Cancer (0.524) 
Inflammation/Trauma (0.310) 
Cell Proliferation (0.143) 


Cancer (0.355) 
Inflammation/Trauma (0 . 342) 
Cell Proliferation (0.211) 


Cancer (0.562) 
Inflammation/Trauma (0 .250) 


Tissue Expression 
(Fraction of Total) 


Cardiovascular (0.238) 
Reproductive (0.238) 
Hematopoietic/ Immune ( 0 . 190) 


Nervous (0.185) 
Reproductive (0.167) 
Gastrointestinal (0.148) 


Reproductive (0.429) 
Nervous (0.142) 
Hematopoietic/Immune (0 . 142) 


Reproductive (0.333) 
Nervous (0.178) 
Cardiovascular (0.111) 


Hematopoietic/Immune (0 .257) 
Reproductive (0.229) 
Gastrointestinal (0 . 143) 


Gastrointestinal ( 0 . 286) 
Reproductive (0.286) 
Cardiovascular (0.238) 


Reproductive (0 .229) 
Hematopoietic/Immune (0 . 157) 
Nervous (0.157) 


Reproductive (0.270) 
Gastrointestinal (0. 162) 
Cardiovascular (0.135) 


Reproductive (0.296) 
Gastrointestinal (0. 167) 
Nervous (0.167) 


Reproductive (0.238) 
Cardiovascular (0 . 190) 
Gastrointestinal (0.190) 


Reproductive (0 .250) 
Nervous (0.224) 
Hematopoietic/Immune (0.132) 
Gastrointestinal (0.132) 


Reproductive (0.375) 
Nervous (0.188) 
Uroloaic (0.188) 


Selected 
Fragments 


434-478 


380-424 
551-595 


433-477 


684-728 


219-263 


865-912 


900-944 


109-153 
919-963 


1352-1396 
1568-1612 


541-585 
1189-1233 
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PBLUESCRIPT 
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pINCY 


pINCY 


Disease or Condition 
(Fraction of Total) 


Inflammation/Trauma (0.636) 
Cancer (0.364) 


Cancer (0.431) 
Inflammation/Trauma (0.451) 
Cell Proliferation (0.196) 


Cancer (0.466) 
Inflammation/Trauma ( 0 . 288) 
Cell Proliferation (0.151) 


Cancer (0 .444) 

Cell Proliferation (0.370) 

Inflammation/Trauma (0.333) 


Inflammation/Trauma (0 .429) 
Cell Proliferation (0.357) 
Cancer (0 .286) 


Cancer (0.556) 

Cell Proliferation (0.111) 


Cancer (0.308) 

Cell Proliferation (0.231) 

Inflammation/Trauma (0 . 154) 


Inflammation (0.394) 

Cancer (0.318) 

Cell Proliferation (0.212) 


Cancer (0.448) 

Cell Proliferation (0.218) 

Inflammation (0 .207) 


Cancer (0.681) 

Cell Proliferation (0.182) 

Inflammation/Trauma (0.318) 


Cancer (0.404) 

Inflammation (0.220) 

Cell Proliferation (0.165) 


Tissue Expression 
(Fraction of Total) 


Hematopoietic/ Immune ( 0 . 227 ) 
Nervous (0.227) 
Reproductive (0.227) 


Repr oduc t ive (0.275) 
Gastrointestinal ( 0 . 196) 
Nervous (0.196) 


Reproductive (0.301) 
Nervous (0.151) 
Gastrointestinal (0 . 130) 


Reproductive (0.333) 
Developmental (0.148) 
Gastrointestinal (0. 148) 


Hematopoietic/ Immune (0.400) 
Gastrointestinal ( 0 . 200) 
Cardiovascular (0 . 100) 


Cardiovascular (0.667) 
Nervous (0.222) 
Hematopoietic/ Immune (0.111) 


Reproductive (0.182) 
Cardiovascular (0.091) 


Hematopoietic/ Immune (0 .288) 
Reproductive (0.197) 
Cardiovascular (0 . 136) 


Reproductive (0.218) 
Nervous (0.172) 
Gastrointestinal (0.138) 


Reproductive (0.273) 
Gas trointestinal (0.227) 
Cardiovascular (0.136) 
Musculoskeletal (0 . 136) 


Reproductive (0.229) 
Gastrointestinal (0. 193) 
Nervous (0.193) 


Selected 
Fragments 


380-424 


217-261 


488-532 
812-856 
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mdition 
Total) 


0.400) 




(0.325) 
0.273) 


(0.326) 

0.306) ! 


(0.284) 

0.150) ; 


(0.468) 
;0.165) 


(0.333) 
(0.273) 




(0.200) 

i 


(0.339) 
(0.231) 


(0.368) 
(0.342) 


(0.390) 
(0.220) 


(0.381) 
(0.190) 




Disease or Cc 
(Fraction of 


Cell Proliferation ( 
Inflammation (0.400) 
Cancer (0.200) 




Cancer (0.468) 

Inf lammat ion/Trauma 

Cell Proliferation ( 


Cancer (0.469) 
Inflammation/Trauma 
Cell Proliferation ( 


Cancer (0.550) 
Inflammation/Trauma 
Cell Proliferation { 


Inf lammat ion/Trauma 
Cancer (0.392) 
i Cell Proliferation < 


Cancer (0.455) 
Inflammation/Trauma 
Cell Proliferation I 




Cancer (0.400) 

I n f 1 amma t i o n / T r auma 

Neurological (0 .200] 


Cancer (0.462) 

Inf lammat ion/Trauma 

Cell Proliferation 


Cancer (0.553) 
Cell Proliferation 
Inflammation/Trauma 


Cancer (0.390) 
Inflammation/Trauma 
Cell Proliferation 


Cancer (0.429) 

I n f 1 amma t i on / Trauma 

Cell Proliferation 


(coi 






























CO 






























Table 


Tissue Expression 
(Fraction of Total) 


Nervous (0.400) 
Cardiovascular (0.200) 
Developmental (0.200) 
Gastrointestinal (0.200) 


Reproductive (0 .221) 
Gastrointestinal (0 . 156) 
Hematopoietic/ Immune (0.143) 


Repr oduc t i ve (0.286) 
Hematopoietic/ Immune ( 0 . 143 ) 
Nervous (0.143) 


Reproductive (0.250) 
Nervous (0.200) 
Cardiovascular ( 0 . 183 ) 


Reproductive (0.291) 
Hematopoietic/ Immune (0.228) 
Nervous (0.152) 


Reproductive (0.242) 
Hematopoietic/ Immune (0. 121) 
Nervous (0.121) 
Urologic (0.121) 


Nervous (0.600) 
Reproductive (0 .400) 


Gastrointestinal (0 . 262) 
Reproductive (0.215) 
Nervous (0.169) 


Repr oduct ive (0.211) 
Gastrointestinal (0.211) 
Hematopoietic/ Immune (0.158) 


Reproductive (0.268) 
Nervous (0.146) 
Cardiovascular (0 .146) 


Gastrointestinal (0.238) 
Cardiovascular (0.190) 
Hematopoietic/ Immune (0 . 143) 
Nervous (0.143) 
Endocrine (0.143) 




Selected 
Fraaments 


98-142 


384-428 
2016-2060 


80-124 
731-775 


437-481 
641-685 


397-441 
1036-1080 


247-291 


453-497 
858-902 
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1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
. SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO.12, SEQ ID NO:13, SEQ ID NO:14, 
SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, 

10 SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID N0.39, SEQ ID NO:40, SEQ ID NO:41, 
SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, 
SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, 
SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, 

15 SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, and SEQ ID NO:66, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6. SEQ ID NO:7, SEQ ID NO:8. SEQ ID NO:9, 
SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, 

20 SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19. SEQ ID NO:20, SEQ ID NO:21, 
SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32. SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:43, 
SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, 

25 SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, 
SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, 
SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, and SEQ ID NO:66, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 

30 NO:6, SEQ ID NO:7, SEQ ID NO:8. SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 
NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:29. SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 

35 NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 
NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ED NO:52, SEQ ID 
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NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ED NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID 
NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ED NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:65, and SEQ ID NO:66, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
5 of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:8, SEQ ED NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID 
NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ED 
NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID 

10 NO:33, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ED 
NO:40, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO.52, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, and SEQ 

15 IDNO:66. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6. SEQ ID NO:7, SEQ ED 
NO:8, SEQ ID NO:9. SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO: 12, SEQ ID NO:13, SEQ ID 

20 NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID 
NO:41, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 

25 NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 
NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, and SEQ ID NO:66. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

30 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID 

35 NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 
NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82. SEQ ID NO:83, SEQ ID NO:84, SEQ ID 
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NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:90, SEQ ID N0:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NO:100. SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, 
SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:109, SEQ ID N0:110, SEQ ID N0:111, SEQ ID 
5 NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 1 4, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 8, 
SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID NO: 131, and SEQ ID NO: 132. 

10 6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 

polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

15 8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 

20 comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

b) recovering the polypeptide so expressed. 

10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

25 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:67, SEQ ID 
NO:68, SEQ ID NO:69, SEQ ID NO:70. SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID 

30 NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID 
NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, SEQ ID NO:87, SEQ ID NO:88. SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO:100, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, 

35 SEQ ID NO:107, SEQ ID NO:109, SEQ ID NO:l 10, SEQ ID NO:l 1 1 , SEQ ID NO:l 12, SEQ ID 
NO: 113, SEQ ID NO: 114, SEQ ID NO:115, SEQ ID NO: 116, SEQ ID NO: 118, SEQ ID NO:l 19, 
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SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, and SEQ ID NO:132, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
5 polynucleotide sequence selected from the group consisting of SEQ ID NO: 67, SEQ ID NO:68, SEQ 
ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ 
ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ 
ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ 
10 ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ 
ID NO-.102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, 
SEQ ID NO:109, SEQ ID NO:l 10, SEQ ID NO:l 1 1, SEQ ID NO:l 12, SEQ ID NO:l 13, SEQ ID 
NO: 114, SEQ ID NO: 115, SEQ ID NO:116, SEQ ID NO: 118, SEQ ID NO: 119, SEQ ID NO:120, 
SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
15 NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID N0.129, SEQ ID NO:130, SEQ ID NO:131, 
andSEQIDNO:132. 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

20 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

1 3. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
25 having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
30 b) detecting the presence or absence of said hybridization complex, and, optionally, if 

present, the amount thereof. 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

35 1 5 . A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

114 



SUBSTITUTE SHEET (RULE 26) 



WO 01/05970 PCT/US00/19698 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutically acceptable excipient 



17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 
10 selected from the group consisting of SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID 
NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID 
NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID 
NO:88, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:95, SEQ ID 
15 NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:102, SEQ ID 
NO:103, SEQ ID NO:104, SEQ CD NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:109, 
SEQ ID NO:l 10, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 14, SEQ ID 
NO: 115, SEQ ID NO: 116, SEQ ID NO: 118, SEQ ID NO:l 19, SEQ ID NO: 120, SEQ ID NO: 121, 
SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
20 NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, and SEQ ID 
NO: 132. 



1 8. A method for treating a disease or condition associated with decreased expression of 
functional GBAP, comprising administering to a patient in need of such treatment the pharmaceutical 
25 composition of claim 16. 



19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
30 b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient 

35 21. A method for treating a disease or condition associated with decreased expression of 

functional GBAP, comprising administering to a patient in need of such treatment a pharmaceutical 
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22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

5 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceutical^ acceptable excipient 

10 

24. A method for treating a disease or condition associated with overexpression of functional 
GB AP, comprising administering to a patient in need of such treatment a pharmaceutical composition 
of claim 23. 



15 25. A method of screening for a compound that specifically binds to the polypeptide of claim 

1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
20 compound that specifically binds to the polypeptide of claim 1 . 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
25 permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

30 in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1. 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

35 comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 
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28. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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SEQUENCE LISTING 



<110> INCYTE GENOMICS , INC. 
YUE, Henry 
TANG, Y. Tom 
BANDMAN, Olga 
HILLMAN, Jennifer L. 
LAL, Preeti 
AU-YOUNG, Janice 
REDDY, Roopa 
YANG, Junming 
BAUGHN, Mariah R. 
LU, Dyung Aina M. 
AZIMZAI, Yalda 
PATTERSON, Chandra 

<12 0> GTP-BINIgNG ASSOCIATED PROTEINS 

<130> PF-0714 PCT 



<140> To Be Assigned 
<141> Herewith 



<150> 60/144,595; 60/150,460; 60/159,849 
<151> 1999-07-19; 1999-08-23; 1999-10-15 



<160> 132 

<170> PERL Program 

<210> 1 

<211> 269 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1405545CD1 

<400> 1 



Met 


Pro 


Ala 


Val 


Leu 


Glu 


Arg 


Leu 


Ser 


Arg 


Tyr 


Asn Ser Thr Ser 


1 








5 










10 




15 


Gin 


Ala 


Phe 


Ala 


Glu 


Val 


Leu 


Arg 


Leu 


Pro 


Lys 


Gin Gin Leu Arg 










20 










25 




30 


Lys 


Leu 


Leu 


Tyr 


Pro 


Leu 


Gin 


Glu 


Val 


Glu 


Arg 


Phe Leu Ala Pro 










35 










40 




45 


Tyr 


Gly 


Arg 


Gin 


Asp 


Leu 


His 


Leu 


Arg 


He 


Phe 


Asp Pro Ser Pro 










50 










55 




60 


Glu 


Asp 


He 


Ala 


Arg 


Ala 


Asp 


Asn 


He 


Phe 


Thr 


Ala Thr Glu Arg 










65 










70 




75 


Asn 


Arg 


He 


Asp 


Tyr 


Val 


Ser 


Ser 


Ala 


Val 


Arg 


He Asp His Ala 










80 










85 




90 


Pro 


Asp 


Leu 


Pro 


Arg 


Pro 


Glu 


Val 


Cys 


Phe 


He 


Gly Arg Ser Asn 










95 










100 




105 


Val 


Gly 


Lys 


Ser 


Ser 


Leu 


He 


Lys 


Ala 


Leu 


Phe 


Ser Leu Ala Pro 










110 










115 




120 


Glu 


Val 


Glu 


Val 


Arg 


Val 


Ser 


Lys 


Lys 


Pro 


Gly 


His Thr Lys Lys 










125 










130 




135 


Met 


Asn 


Phe 


Phe 


Lys 


Val 


Gly 


Lys 


His 


Phe 


Thr 


Val Val Asp Met 










140 










145 




150 


Pro 


Gly 


Tyr 


Gly 


Phe 


Arg 


Ala 


Pro 


Glu 


Asp 


Phe 


Val Asp Met Val 










155 










160 




165 


Glu 


Thr 


Tyr 


Leu 


Lys 


Glu 


Arg 


Arg 


Asn 


Leu 


Lys 


Arg Thr Phe Leu 
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170 








Leu Val 


Asp 


Ser 


Val 


Val 


Gly He 








185 








Tip filii 






Glu 


\S A n 


Phe 


Ala 








200 








Thr Lys 


He 


Asp 


Lys 


Ser 


Ser 


Lys 








215 










Tie 


Gin 


Lys 


Phe 


Val 


Asn 








-J w 








Pro Gin 


Leu 


Phe 


Pro 


Val 


Ser 


Ala 








245 








Leu Leu 


Arg 


Cys 


Phe 


He 


Ala 


Ser 








260 








<210> 2 














<211> 428 












<212> PRT 












<213> Homo sapiens 








<220> 














<221> misc_feature 








<223> Incyte ID 


No: 


1451265CD1 


<400> 2 














Met Glu 


Val 


Ala 


Val 


Cys 


Thr 


Asp 


1 






5 








Cys He 


Val 


Trp 


Glu 


Leu 


His 


Ser 








20 








Arg Gly 


Gly 


Gin 


Ala 


Gly 


Pro 


Arg 








35 








Glu Tyr 


Leu 


Leu 


Ala 


Ala 


Gin 


Leu 








50 








Trp Glu 


Leu 


Gin 


Arg 


Lys 


Asp 


Gin 








65 








Pro Gly 


Pro 


Val 


Thr 


Cys 


Leu 


Thr 








80 








Val Leu 


Ala 


Gly 


Val 


Ala 


Glu 


Ser 








95 








Thr Gly 


Asn 


Leu 


Leu 


Val 


He 


Leu 








110 








Ser Cys 


Leu 


Gin 


Phe 


Thr 


Gly 


Asp 








125 








Gly Lys 


Asp 


Cys 


Leu 


Val 


Leu 


Val 








140 








Gin Ala 


Asp 


Pro 


Ser 


Arg 


He 


Pro 








155 








His His 


Thr 


Leu 


Pro 


He 


Thr 


Asp 








170 








Pro Leu 


Ala 


Arg 


Val 


Ala 


Thr 


Ser 








185 








Leu Trp 


Glu 


Val 


Ser 


Ser 


Gly 


Glu 








200 








Asp Val 


Ser 


He 


Met 


Ala 


Val 


Thr 








215 








Met Phe 


Cys 


Gly 


Gly 


Ser 


Glu 


Gly 








230 








Phe Thr 


Trp 


Pro 


Gly 


Gin 


Arg 


Glu 








245 








Asp Ala 


Gly 


Lys 


Val 


Phe 


Lys 


Gly 








260 








Leu Ser 


Val 


Ser 


Thr 


Asp 


Gly 


Ser 








275 








Asp Glu 


Thr 


Val 


Arg 


Leu 


Trp 


Asp 








290 











175 






180 


Gin 


Lvs 


Thr Asp Asn 


He 


Ala 




190 






195 


Leu 


Pro 


Tyr Val He 


Val 


Leu 




205 






210 


Gly 


His 


Leu Leu Lys 


Gin 


Val 




220 






225 


Met 


Lys 


Thr Gin Gly Cys 


Phe 




235 






240 


Val 


Thr 


Phe Ser Gly 


He 


His 




250 






255 


Val 


Thr 


Gly Ser Leu 


Asp 
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Ser 


Ala 


Ala Pro 


Met Trp 


Ser 
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15 


Gly 


Ala 


Asn Leu 


Leu Thr 


Tyr 




25 






J U 


Gly 


Leu 


Ala Leu 


Leu Asn 


Gly 










A ^ 


Gly 


Lys 


Asn Tyr 


He Ser 
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oil 


Leu 


Gin 


Gin Lys 


He Met 
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/U 
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Pro Asn Gly Leu 


Tyr 
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90 


He 


His 


Leu Trp 


Glu Val 


Ser 
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Arg 


His Tyr 
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Val 
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Arg 


Thr 


Val Ala 


Leu 
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Lys 


Gly 


Pro 


Leu 


Ala 


Pro Val 


Ser 
320 


Met 


Leu 
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Pro 


Leu 


Pro His 


Phe 
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Lys 
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Glu 


Pro Arg 
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Gly 


Leu 


Gin 


Gin 
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Tyr 


Gin 
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Val Leu 
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Gin 
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Arg 
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<211> 334 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 5308471CD1 

<400> 8 



Mpf 


ax. y 


T .01 1 


Thr 

X ILL 


iX \J 


Aig 


AT a 


Leu Cys 




Ala Ala Gin Ala Ala 

Axa ax a vjxii nia rtxa 


1 

X 








c 










1 0 
X u 


15 






gi ii 


Asn 


Phe 


Pro 
XrX \J 


Leu 


Cys Gly 


Arcr 
ax y 


Asr» Val Ala Ara Tro 










?o 












10 


irxic 


Pro 


\j x y 


Hi q 
nxo 




Ala 


Lys 


Gly Leu 


Lys 


T.v<; Met Gin Ser Ser 










^S 










40 


45 


Leu 


Lys 


T «n 


val 


Asp 


Cys 


Tip 


lie 


Glu 


Val 
Val 


pic icn Ala Arri Tie 
alb rits y ftld rix y lie 




















«j ~> 


60 


Pro 


Leu 


Coy 


VjXjt 


Arg 


Asn 


Pro 


Leu 


Phe 


OXXi 


Gin Thr T.en Glv T.eii 
ox u x ixx ucu uxy xjt^ix 




















70 


7S 


Lys 


Pro 


nib 


Leu 


Leu 


Val 
val 


XjSU 


Asn 


Lys 




Icrt T.on Ala Aqt^ T^eu 
ASky iJCU AXa rio^j XjCLX 










on 
o u 












QO 


X XIX 




VjXIi 


gi n 


Lys 


Tl e 
x ic 


Mpf 


Gin 


His 


T .pi i 
XJC ll 


Glu Glv Gin Gl vr T.pii 
oxu o x y vjxu oiy ucu 










Q s 










i no 

xuu 


1 OS 

IvJ 


T,ve 


Asn. 


Val 


Tl e 

X J.X2 


Phe 


Thr 


Asn 


Cys 


Val 


Lys 


ten Glu Asn Val Lvs 










1 1 0 

X X V 










1 1 5 

X X J 


' 120 


VjXll 


Tip 
IXC 


Tl e 
X X c 


Zr X W 


Met 


Val 


Thr 


Glu 


Leu 


He 


Civ Arci Ser His Arcr 










X £t J 










1 30 

X -> u 


135 




His 
n x £> 


Axg 


Lys 


Glu 


Asn 


Leu 


Glu Tyr 




Tie Met Val Tie Glv 










1 Art 

X *± u 










X *i J 


ISO 

X J w 


Val 

v a A. 


Pro 


Acn 


vox 


Gl v 


Lys 


Cpr 


Ser 


Leu 


He 


Sen Car T,pn ArCT ArCT 
noil k^^x. ucu fix y «^x. y 










X J J 










160 


165 


Gl n 








T .vrc 


gi v 

oxy 




Ala 


Thr 


Arrr 


Val Glv Glv Glu Pro 
vox vj J- y vxy uiu t j. v 










170 
X / w 










175 


180 




Tl e 

X X c 


Thr 

X I IX 


Arrr 
Axy 


Ala 

Aid 


Val 

vox 


Mpf 


Ser 


Lys 


He 


Gin Val Ser Glu Ara 

win vax mci vjx u ftiy 










185 










190 


195 


Pro 


Leu 


Met 


Phe 


Leu 


Leu 


Asp 


Thr 


Pro 


Gly 


Val Leu Ala Pro Arg 










200 










205 


210 


lie 


Glu 


Ser 


val 


Glu 


Thr 


Gly 


Leu 


Lys 


Leu 


Ala Leu Cys Gly Thr 










215 










220 


225 


Val 


Leu 


Asp 


His 


Leu 


Val 


Gly 


Glu 


Glu 


Thr 


Met Ala Asp Tyr Leu 










230 










235 


240 


Leu 


Tyr 


Thr 


Leu 


Asn 


Lys 


His 


Gin 


Arg 


Phe 


Gly Tyr Val Gin His 










245 










250 


255 


Tyr 


Gly 


Leu 


Gly 


Ser 


Ala 


Cys 


Asp 


Asn 


Val 


Glu Arg Val Leu Lys 










260 










265 


270 


Ser 


Val 


Ala 


Val 


Lys 


Leu 


Gly 


Lys 


Thr 


Gin 


Lys Val Lys Val Leu 










275 










280 


285 


Thr 


Gly 


Thr 


Gly 


Asn 


Val 


Asn 


Val 


He 


Gin 


Pro Asn Tyr Pro Ala 










290 










295 


300 


Ala 


Ala 


Arg 


Asp 


Phe 


Leu 


Gin 


Thr 


Phe 


Arg 


Arg Gly Leu Leu Gly 










305 










310 


315 


Ser 


Val 


Met 


Leu 


Asp 


Leu 


Asp 


Val 


Leu 


Arg 


Gly His Pro Pro Ala 










320 










325 


330 


Glu 


Thr 


Leu 


Pro 

















<210> 9 
<211> 341 
<212> PRT 



He Glu Asp Val Leu 
220 

Lys Arg Asp Ser Asp 
235 



Leu Ser Ser Val Arg 
225 

Ser Glu Gly Asp 
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<213> Homo sapiens 








<220> 










<221> misc_feature 








<223> Incyte ID 


No : 


5324322CD1 


<400> 9 










Met Glu Arg Ala 


Val 


Pro 


Leu 


Ala Val 


1 


5 








Val Phe Gin Ala 


Leu 


Gin 


Arg 


Leu His 




20 








Ser Val Ser Pro 


Cvs 


Gly 


Lys 


Phe Leu 




35 








Glv Gin lie Ala 


lie 


Phe 


Ser 


Leu Ser 




50 








Al a Ta/Q f5l 11 f*l 11 


Ser 


Lys 


Lys 


Pro Val 




65 








len f2l v Pro Val 
nop VJJiy ri.\J v a x 


xyx 


Ser 


Met 


Val Ser 




80 








C Ala fil v A^Ti 
■sex Aia. uiy ao^j 


Glv 


Glu 


Val 


Lys Ala 




95 








T.011 T.VQ T.V^ f^l \7 


uyt> 


Lys 


Glu 


Leu Trp 




1 1 V-/ 










f2l u 

Vjl LI 


Val 


Pro 


Glu He 












Prn T.vg fSl 11 Acn 


Ser 


Leu 


He 


Leu Ala 




1 40 








Hi«; Thr Met Asd 


Leu 


Glu 


Thr 


Gly Thr 




1 55 

1 -J -J 








vjiy niu ii ix nop 


l^yx 


He 


His 


Cys Leu 




1 70 
1 / w 








Pva CI 1 ii T7a 1 T ei i 
riu Vjiu val wcu 


OCl 


Gly 


Gly 


Glu Asp 




1 R5 

lO J 








Aisp IJCU niy 1 ill 


Ala 


Lys 


Glu 


Val Gin 




?oo 

^ \J V 








HtC f^lll f^lll C*~\.T C 

nis oiu ijiu uy s 


OCX 


Arg 


Pro 


His Asn 




215 
^ i j 








LjfefU Aid 1 111 nbp 


Ser 


Asp 


Trp 


Met Val 












litrUl J-ilX XjtrLL 1 I JJ 


His 


Leu 


Arg 


Ser Ser 




245 








Pro Tl p Ara Ala 

tlx-* 11C A.1 y Aid 


Pro 


Gin 


Lys 


His Val 




960 

L* V V 








Tie* Car" Ala 

i J. C UwU b?WJ> 


Glv 
uiy 


Gin 


Gly 


Arg Cys 




275 








Ser Glv Glu Leu 


Lys 


Ala 


Gin 


Val Pro 




290 








Leu Ser Leu Ser 


Leu 


Asn 


Gin 


Gin Pro 




305 








Val Leu Thr Ala 


Ala 


Gly 


Asn 


Ser Cys 




320 








Asn Leu Gly Tyr 


Arg 


Ala 


Phe 


Ser Leu 




335 








<210> 10 










<211> 513 










<212> PRT 










<213> Homo sapiens 








<220> 










<221> misc_feature 








<223> Incyte ID 


No: 


067184CD1 


<400> 10 











PCT/US00/19698 



Pro Leu Gly Gin 


Thr 


Glu 


10 






15 


Met Thr 


He Phe 


Ser 


Gin 


25 






30 


Ala Ala Gly Asn 


Asn 


Tyr 


40 






45 


Ser Ala 


Leu Ser 


Ser 


Glu 


55 






60 


Val Thr 


Phe Gin 


Ala 


His 


70 






75 


Thr Asp 


Arg His 


Leu 


Leu 


85 






90 


Trp Leu 


Trp Ala 


Glu 


Met 


100 






105 


Arg Arg Gin Pro 


Pro 


Tvr 


115 






120 


Asn Ala 


Leu Leu 


Leu 


Val 


130 






135 


Gly Gly Asp Cys 


Gin 


Leu 


145 






150 


Phe Thr 


Arg Val 


Leu 


Arg 


160 






165 


Ala Leu Arg Glu 


Arg 


Ser 


175 






180 


Gly Ala Val Arg 


Leu 


Trp 


190 






195 


Thr He 


Glu Val 


Tyr 


Lys 


205 






210 


Gly Arg 


Trp He 


Gly Cys 


220 






225 


Cys Gly Gly Gly 


Pro 


Ala 


235 






240 


Thr Pro 


Thr Thr 


He 


Phe 


250 






255 


Thr Phe 


Tyr Gin 


Asp 


Leu 


265 






270 


Val Asn 


Gin Trp 


Gin 


Leu 


280 






285 


Gly Ser 


Ser Pro 


Gly Leu 


295 






300 


Ala Ala 


Pro Glu 


Cys 


Lys 


310 






315 


Arg Val 


Asp Val 


Phe 


Thr 


325 






330 


Ser Phe 








340 
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Met 


Ser 


He 


Glu 


lie 


Glu 


1 








5 




Gin 


Tyr 


Leu 


Lys 


Glu 
20 


Asn 


Gin 


Glu 


Glu 


Thr 


Thr 
35 


Val 


Ser 


Phe 


Val 


Ala 


Asp 
50 


He 


Gin 


Ala 


He 


Gin 


Ser 
65 


Leu 


Leu 


Tyr 


Glu 


Gin 


Val 
80 


Val 


Gly 


Ala 


Ala 


Arg 


Ser 
95 


Leu 


Leu 


Lys 


Gin 


Thr 


Gin 
110 


Pro 


Leu 


Ala 


Arg 


Ser 


Tyr 
125 


Phe 


Ser 


Ser 


Lys 


Glu 


Lys 
140 


Arg 


Gly 


Glu 


Val 


Ser 


Val 
155 


Val 


Gly 


Gin 


Ala 


Leu 


Lys 
170 


Trp 


Gly 


Met 


Thr 


He 


Asp 
185 


Leu 


Val 


Glu 


Glu 


Glu 


Lys 
200 


Phe 


Phe 


Gly 


Gin 


Lys 


Ser 
215 


His 


Gly 


Gin 


Tyr 


Leu 


Val 

230 


Thr 


Trp 


Asn 


Phe 


Thr 


Thr 
245 


Gly 


Ala 


Gin 


Asp 


Asn 


Phe 
260 


Met 


Cys 


Phe 


Ser 


Arg 


Asp 
275 


Thr 


Gly 


Lys 


He 


Lys 


Val 
290 


Trp 


Arg 


Phe 


Glu 


Arg 


Ala 
305 


His 


Ser 


Lys 


Asp 


Ser 


Ser 
320 


Gin 


He 


Arg 


He 


His 


Gly 
335 


Leu 


Arg 


Gly 


His 


Ser 


Ser 
350 


Phe 


Gly 


His 


Tyr 


lie 


lie 
365 


Ser 


Trp 


Asn 


Met 


Lys 


Thr 
380 


Thr 


Gly 


Ser 


Thr 


Ala 


Gly 
395 


Thr 


Leu 


Pro 


Lys 


Asn 


Pro 
410 


Glu 


Thr 


Val 


Val 


lie 


Met 
425 


Asn 


Ser 


Ser 


Gly 


Lys 


Arg 
440 


Glu 


Ser 


Pro 


Arg 


Gly 


Glu 
455 


Trp 


Leu 


Tyr 


Cys 


Phe 


Ser 


Thr 



PCT/US00/19698 



Ser 


Ser 


Asp 


Val 
10 


lie 


Arg 


Leu 


lie 


Met 
15 


Ser 


Leu 


His 


Arg 
25 


Ala 


Leu 


Ala 


Thr 


Leu 
30 


Ser 


Leu 


Asn 


Thr 
40 


Val 


Asp 


Ser 


He 


Glu 
45 


Asn 


Ser 


Gly 


His 
55 


Trp 


Asp 


Thr 


Val 


Leu 
60 


Lys 


Leu 


Pro 


Asp 
70 


Lys 


Thr 


Leu 


He 


Asp 
75 


Leu 


Glu 


Leu 


He 


Glu 


Leu 


Arg 


Glu 


Leu 






85 










90 


Leu 


Arg 


Gin 


Thr 
100 


Asp 


Pro 


Met 


lie 


Met 
105 


Glu 


Arg 


Tyr 


lie 
115 


His 


Leu 


Glu 


Asn 


Leu 
120 


Asp 


Pro 


Arg 


Glu 
130 


Ala 


Tyr 


Pro 


Asp 


Gly 
135 


Arg 


Ala 


Ala 


lie 
145 


Ala 


Gin 


Ala 


Leu 


Ala 
150 


Pro 


Pro 


Ser 


Arg 
160 


Leu 


Met 


Ala 


Leu 


Leu 
165 


Gin 


Gin 


His 


Gin 
175 


Gly 


Leu 


Leu 


Pro 


Pro 
180 


Phe 


Arg 


Gly 


Lys 
190 


Ala 


Ala 


Val 


Lys 


Asp 
195 


Pro 


Thr 


Gin 


Leu 
205 


Ser 


Arg 


His 


He 


Lys 
210 


Val 


Glu 


Cys 


Ala 
220 


Arg 


Phe 


Ser 


Pro 


Asp 
225 


Gly 


Ser 


Val 


Asp 
235 


Gly 


Phe 


He 


Glu 


Val 
240 


Lys 


lie 


Arg 


Lys 
250 


Asp 


Leu 


Lys 


Tyr 


Gin 
255 


Met 


Met 


Asp 


Asp 
265 


Ala 


Val 


Leu 


Cys 


Met 
270 


Glu 


Met 


Leu 


Ala 

280 


Thr 


Gly 


Ala 


Gin 


Asp 
285 


Lys 


He 


Gin 


Ser 
295 


Gly 


Gin 


Cys 


Leu 


Arg 
300 


Ser 


Lys 


Gly 


Val 
310 


Thr 


Cys 


Leu 


Ser 


Phe 
315 


He 


Leu 


Ser 


Ala 
325 


Ser 


Phe 


Asp 


Gin 


Thr 

330 


Lys 


Ser 


Gly 


Lys 
340 


Thr 


Leu 


Lys 


Glu 


Phe 
345 


Val 


Asn 


Glu 


Ala 
355 


Thr 


Phe 


Thr 


Gin 


Asp 
360 


Ala 


Ser 


Ser 


Asp 
370 


Gly 


Thr 


Val 


Lys 


lie 
375 


Glu 


Cys 


Ser 


Asn 
385 


Thr 


Phe 


Lys 


Ser 


Leu 
390 


Asp 


He 


Thr 


Val 
400 


Asn 


Ser 


Val 


He 


Leu 
405 


His 


Phe 


Val 


Val 
415 


Cys 


Asn 


Arg 


Ser 


Asn 
420 


Met 


Gin 


Gly 


Gin 
430 


lie 


Val 


Arg 


Ser 


Phe 
435 


Gly 


Gly 


Asp 


Phe 
445 


Val 


Cys 


Cys 


Ala 


Leu 
450 


lie 


Tyr 


Cys 


Val 
460 


Gly 


Glu 


Asp 


Phe 


Val 
465 


Val 


Thr 


Gly 


Lys 


Leu 


Glu 


Arg 


Thr 


Leu 
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470 








475 




480 


Thr Val 


His 


Glu 


Lys 


Asp 


Val 


He Gly He 


Ala His His Pro 


His 








485 








490 




495 


Gin Asn 


Leu 


lie 


Ala 


Thr Tyr Ser Glu Asp 


Gly Leu Leu Lys 


Leu 








500 








505 




510 


Trp Lys 


Pro 


















<210> 11 


















<211> 186 


















<212> PRT 


















<213> Homo sapiens 














<220> 




















<221> misc_f eature 














<223> Incyte ID 


No : 


722896CD1 








<400> 11 


















Met lie 


Ala 


Leu 


Phe 


Asn 


Lys 


Leu 


Leu Asp 


Ttt> Phe Lvs Ala 


Leu 


1 






5 








10 




15 


Phe Trp 


Lys 


Glu 


Glu 


Met 


Glu 


Leu 


Thr Leu 


Val Glv Leu Gin 


Tvr 








20 








25 




30 


Ser Gly 


Lys 


Thr 


Thr 


Phe 


Val 


Asn 


Val He 


Ala Ser Glv Gin 


Phe 








35 








40 




45 


Asn Glu 


Asp 


Met 


lie 


Pro 


Thr 


Val Gly Phe 


Asn Met Arcr Lvs 


He 








50 








55 




60 


Thr Lys 


Gly 


Asn 


Val 


Thr 


He 


Lys 


Leu Trp 


Asd He Glv Glv 


Gin 








65 








70 




75 


Pro Arg 


Phe 


Arg 


Ser 


Met 


Trp 


Glu Arg Tyr 


Cvs Ara Glv Val 


Ser 








80 








85 




90 


Ala lie 


Val 


Tyr 


Met 


Val 


Asp 


Ala Ala Asp 


Gin Glu Lvs Tie 


Glu 








95 








100 




105 


Ala Ser 


Lys 


Asn 


Glu 


Leu 


His 


Asn 


Leu Leu 


Ast3 Lvs Pro Gin 


Leu 








110 








115 




J, 4b V 


Gin Gly 


lie 


Pro 


Val 


Leu 


Val 


Leu Gly Asn 




Pro 








125 








130 




135 


Gly Ala 


Leu 


Asp 


Glu 


Lys 


Glu 


Leu 


He Glu 


LiVs Met" Asn Lieu 


Ser 








140 








145 




150 


Ala lie 


Gin 


Asp 


Arg 


Glu 


He 


Cys 


Cys Tyr 


cpy Tie Qpr Pvs 

OCX. 11" JC1 Vrjrd 


Lys 








155 








160 




165 


Glu Lys 


Asp 


Asn 


He 


Asp 


He 


Thr 


Leu Gin 


Tttj Leu Tie Gin 


His 








170 








175 




180 


Ser Lys 


Ser 


Arg 


Arg 


Ser 


















185 














<210> 12 


















<211> 204 


















<212> PRT 


















<213> Homo sapiens 














<220> 




















<221> misc_: 


feature 














<223> Incyte ID 


No: 


1571739CD1 








<400> 12 


















Met Asn Asp Val 


Lys 


Leu 


Ala 


Val 


Leu Gly 


Glv Glu Glv Thr 


Glv 


1 






5 








10 




15 


Lys Ser 


Ala 


Leu 


Thr 


Val 


Arg 


Phe 


Leu Thr 


TjVs Arcr Phe Tie 


Glv 








20 








25 




30 


Glu Tyr 


Ala 


Ser 


Asn 


Phe 


Glu 


Ser 


He Tyr 


Lys Lys His Leu 


Cys 








35 








40 




45 


Leu Glu 


Arg 


Lys 


Gin 


Leu 


Asn 


Leu 


Glu He 


Tyr Asp Pro Cys 


Ser 








50 








55 




60 


Gin Thr 


Gin 


Lys 


Ala 


Lys 


Phe 


Ser 


Leu Thr 


Ser Glu Leu His 


Trp 








65 








70 




75 
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Ala Asp Gly 


Phe 


Val 


lie Val Tyr Asp 


He 


Ser Asp Arg Ser 


Ser 




80 






85 




90 


Phe Ala Phe 


Ala 


Lvs 


Ala Leu lie 


Tyr 


Arg 


He Arg Glu Pro 


Gin 






95 






100 




105 


Thr Ser His 


CVs 


Lys 


Arg Ala Val 


Glu 


Ser 


Ala Val Phe Leu 


Val 




110 






115 




120 


Gly Asn Lys 


Ara 


Aso 


Leu Cys His 


Val 


Arg 


Glu Val Gly Trp 


Glu 






125 






130 




135 


Glu Gly Gin 


Lys 


Leu 


Ala Leu Glu 


Asn 


Arg 


Cys Gin Phe Cys 


Glu 




140 






145 




150 


Leu Ser Ala 


Ala 


Glu 


Gin Ser Leu 


Glu 


Val 


Glu Met Met Phe 


He 






155 






160 




165 


Arg lie lie 


Lys 


Asp 


lie Leu lie 


Asn 


Phe 


Lys Leu Lys Glu 


Lys 




170 






175 




180 


Arg Arg Pro 


Ser 


Gly 


Ser Lys Ser 


Met 


Ala 


Lys Leu He Asn 


Asn 






185 






190 




195 


Val Phe Gly 


Lys 


Arg 


Arg Lys Ser 


Val 












200 












<210> 13 
















<211> 100 
















<212> PRT 
















<213> Homo sapiens 












<220> 
















<221> misc_feature 












<223> Incyte ID 


No: 


1739479CD1 










<400> 13 
















Met Trp Asp 


Ser 


Lys 


Lys lie Gly Leu 


Arg 


Gin His His Cys 


Arg 


1 




5 






10 




15 


Lys Cys Gly 


Lys 


Ala 


Val Cys Gly Lys 


Cvs 


Ser Ser Lys Arg 


Ser 






20 






25 




30 


Ser lie Pro 


Leu 


Met 


Gly Phe Glu 


Phe 


Glu 


Val Arg Val Cys 


Asp 






35 






40 




45 


Ser Cys His 


Glu 


Ala 


lie Thr Asp Glu 


Glu 


Arg Ala Pro Thr 


Ala 






50 






55 




60 


Thr Phe His 


Asp 


Ser 


Lys His Asn 


He 


Val 


His Val His Phe 


ASP 






65 






70 




75 


Ala Thr Arg 


Gly 


Trp 


Leu Leu Thr 


Ser 


Gly 


Thr Asp Lys Val 


He 






80 






85 




90 


Lys Leu Trp 


Asp 


Met 


Thr Pro Val 


Val 


Ser 










95 






100 






<210> 14 
















<211> 795 
















<212> PRT 
















<213> Homo sapiens 












<220> 
















<221> misc_: 


feature 












<223> Incyte ID 


No: 


1999147CD1 










<400> 14 
















Met Thr Ser 


Gly 


Ala 


Thr Arg Tyr 


Arg 


Leu 


Ser Cys Ser Leu 


Arg 


1 




5 






10 




15 


Gly His Glu 


Leu 


Asp 


Val Arg Gly Leu 


Val 


Cys Cys Ala Tyr 


Pro 






20 






25 




30 


Pro Gly Ala 


Phe 


Val 


Ser Val Ser Arg 


Asp 


Arg Thr Thr Arg 


Leu 






35 






40 




45 


Trp Ala Pro 


Asp 


Ser 


Pro Asn Arg 


Ser 


Phe 


Thr Glu Met His 


Cys 




50 






55 




60 


Met Ser Gly 


His 


Ser 


Asn Phe Val 


Ser 


Cys 


Val Cys He He 


Pro 




65 






70 




75 


Ser Ser Asp 


lie 


Tyr 


Pro His Gly Leu 


He 


Ala Thr Gly Gly 


Asn 
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Asp His Asn 
Tyr lie Leu 
Gly Lys Phe 
Lys Val Trp 
Thr Ala Ala 
Met Leu Thr 
Gly Arg Cys 
Gly Leu Ala 
Asp Ala Ser 
Val Tyr Tyr 
Pro Asn Cys 
Arg lie Trp 
Ala Gin Ser 
Val Val Gly 
Glu Asp Arg 
Glu Leu Ser 
Asp lie Asn 
Pro Gly Thr 
Lys Val Glu 
Lys lie Gly 
Ser Gly Lys 
Ser lie Asp 
Asn Thr Ser 
Lys Asn Asp 
lie lie Asp 
Ser Phe Ser 
Ser Ser Gly 
Gly Ala Gly 
Met Ala Gly 
Ala Ala Ser 
Val Thr Phe 





80 




lie 


Cvs 


He 




95 




Lvs 


Gly 


His 




110 




Gly 


Thr 


Leu 




125 




Leu 


Asn 


Asp 




140 




Val 


Trp 


Ala 




155 




Gly 


Ser 


Ala 




170 




Glu 


Arq 


Thr 




185 




lie 


Leu 


Ser 




200 




He 


Arg 


Arg 




215 




Gly 


His 


Thr 




230 




Aro 


Asp 


Phe 




245 




Lys 


His 


Glv 




260 




He 








?7S 




Ala 


Ser 


Asp 




290 




Thr 


Ala 


Ser 




305 




His 


Ala 


Thr 




320 




Ala 


Glu 


Gin 




335 




Arg 


Glu 


Glv 




350 




Ala 




Gin 




365 




Asp 


Val 


Val 




380 




Val 


Leu 


Tvr 
i J i 




395 




Val 


Asn 


Glu 




410 




Asp 


Asp 


Pro 




425 




Leu 


Asn 


Pro 




440 




Asn 


Thr 


Lys 




455 




Asp 


Pro 


Phe 




470 




Ser 


Ser 


Asn 




485 




Arcr 


Tvr 


Val 




500 




Val 


Asp 


Pro 




515 




Lys 


Thr 


Met 




530 




Asp 


Gin 


Ala 




545 





Phe 


Ser 


Leu 


Lys 


Asn 


Thr 


Leu 


Ser 


Gly 


Lys 


Cys 


Met 


Val 


Lys 


He 


Asp 


Lys 


Thr 


Phe 


Ser 


Gly 


Glu 


Thr 


Glu 


Trp 


Gin 


He 


Asn 


Tyr 


He 


Val 


Thr 


Thr 


Glu 


Cys 


Ala 


Cys 


Cys 


Val 


Gly 


He 


He 


Ala 


Glu 


Glu 


He 


Asp 


Ser 


Leu 


Pro 


Gly 


Gin 


Thr 


Arg 


Trp 


Ser 


val 


Gly 


Ser 


Ser 


Glu 


Gly 


Lys 


Gly 


Gly 


Pro 


Trp 


Leu 


Thr 


Met 


Phe 


Leu 


Gly 


Gin 


Met 


Thr 


Gly 


Gly 


Thr 


Leu 


Pro 


Pro 


Gly 


Ser 


Phe 


Thr 


Gly 


Asn 


He 


Tyr 


Asn 


Pro 


Thr 



85 






Asp 


Ser 


Pro 


100 






Val 


Cvs 


Ser 


115 






Ser 


Trp 

* *■ tr 


Asp 


130 






Met 


Thr 


Leu 


145 






Leu 


Pro 


Glu 


160 






Val 


Lys 


Leu 


175 






His 


Glu 


Asp 


190 






Phe 


Leu 


Ser 


205 






Thr 


Gly 


Glu 


220 






Tvr 


Ser 


He 


235 






Ala 


Glu 


Asp 


250 






Gin 


Thr 


He 


265 






Leu 


Asp 


Asn 


280 






Arg 


Val 


Phe 


295 






He 


Lvs 


Ala 


310 






Lvs 


Thr 


Gly 


325 






Arg 


Glu 


His 


340 






Leu 


He 


Ara 


355 






Ser 


Glu 


Glv 


370 






Glv 


Ala 


Asn 


385 






Glu 


Phe 


Asp 


400 






Ser 


Tvr 


Lvs 


415 






Ala 


Tyr 


Asn 


430 






Asp 


Gin 


Val 


445 






Leu 


Gly 


Leu 


460 






Gly 


Arg 


Tyr 


475 






Thr 


Ala 


Asp 


490 






Ala 


Ser 


Met 


505 






Asn 


Ser 


Ala 


520 






Phe 


Pro 


Lys 


535 






Gin 


He 


Leu 


550 











90 


Met 


Pro 


Leu 






105 


Leu 


Ser 


Ser 






120 


Thr 


Thr 


Ala 






135 


Gin 


Gly 


His 






150 


Gin 


Gly 


Leu 






165 


Trp 


Lys 


Ala 






180 


Cys 


Val 


Arg 






195 


Cys 


Ala 


Asn 






210 


Cys 


Leu 


Glu 






225 


Ser 


Val 


Phe 






240 


Arg 


Ser 


Leu 






255 


Arq 


Leu 


Pro 






270 


Gly 


Asp 


He 






285 


Thr 


Glu 


Ser 






300 


Phe 


Glu 


Lys 






315 


Asp 


Leu 


Gly 






330 


Leu 


Asn 


Glu 






345 


Asp 


Gly 


Glu 






360 


Arq 


Trp 


He 






375 


Gin 


Gin 


Thr 






390 


Tvr 


Val 


Phe 






405 


Leu 


Pro 


Tyr 






420 


Phe 


Leu 


Gin 






435 


Ala 


Lvs 


Phe 






450 


Gly 


Asn 


Pro 






465 


Val 


Pro 


Gly 






480 


Pro 


Phe 


Thr 






495 


Gly 


Thr 


Thr 






510 


Tyr 


Arg 


Ser 






525 


Lys 


Glu 


Ala 






540 


Gly 


Lys 


Leu 






555 
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Lys Glu 


Leu 


Asn 


Gly 


Thr 


Ala 


Pro 


Glu 


Glu 


Lys 


Lys 


Leu 


Thr 


Glu 








560 










565 










570 


Asp Asp 


Leu 


lie 


Leu 


Leu 


Glu 


Lys 


He 


Leu 


Ser 


Leu 


He 


Cys Asn 








575 










580 










585 


Ser Ser 


Ser 


Glu 


Lys 


Pro 


Thr 


Val 


Gin 


Gin 


Leu 


Gin 


He 


Leu 


Trp 








590 










595 










600 


Lys Al a 


lie 


Asn 


Cys 


Pro 


Glu 


Asp 


He 


Val 


Phe 


Pro 


Ala 


Leu 


Asp 








605 










610 










615 


lie Leu 


Arg 


Leu 


Ser 


lie 


Lys 


His 


Pro 


Ser 


Val 


Asn 


Glu 


Asn 


Phe 








620 










625 










630 


Cys Asn 


Glu 


Lvs 


Glu 


Gly 


Ala 


Gin 


Phe 


Ser 


Ser 


His 


Leu 


He 


Asn 








635 










640 










645 


Leu Leu 


Asn 


Pro 


Lvs 
*j j o 


Glv 


Lvs 


Pro 


Ala 


Asn 


Gin 


Leu 


Leu 


Ala 


Leu 








650 










655 










660 


Arg Thr 


Phe 


Cys 


Asn 


Cvs 


Phe 


Val Gly Gin 


Ala Gly Gin 


Lys 


Leu 








665 










670 










675 


Met Met 


Ser 


Gin 


Ara 


Glu 


Ser 


Leu 


Met 


Ser 


His 


Ala 


He 


Glu 


Leu 








680 










685 










690 


Lys Ser 


Gly Ser 


Asn 


Lvs 


Asn 


He 


His 


He 


Ala 


Leu 


Ala 


Thr 


Leu 








695 










700 










705 


Ala Leu 


Asn Tyr 


Ser 


Val 


Cvs 


Phe 


His 


Lys 


Asp 


His 


Asn 


He 


Glu 








710 










715 










720 


Gly Lys 


Ala 


Gin 


Cvs 


Leu 


Ser 


Leu 


He 


Ser 


Thr 


He 


Leu 


Glu 


Val 








725 










730 










735 


Val Gin 


Asp 


Leu 


Glu 


Ala 


Thr 


Phe Arg Leu 


Leu 


Val 


Ala 


Leu Gly 








740 










745 










750 


Thr Leu 


lie 


Ser 


Asp 


Asp 


Ser 


Asn 


Ala 


Val 


Gin 


Leu 


Ala 


Lys 


Ser 








755 










760 










765 


Leu Gly 


Val 


Asp 


Ser 


Gin 


lie 


Lys 


Lys 


Tyr 


Ser 


Ser 


val 


Ser 


Glu 








770 










775 










780 


Pro Ala 


Lys 


Val 


Ser 


Glu 


Cys 


Cys Arg 


Phe 


He 


Leu 


Asn 


Leu 


Leu 



785 790 795 

<210> 15 

<211> 393 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2182085CD1 

<400> 15 



Met 


Glu 


Asp 


Phe 


Glu 


Asp 


Asp 


Pro Arg Ala 


Leu Gly Ala 


Arg Gly 


1 








5 






10 




15 


His 


Arg 


Arg 


Ser 


Val 


Ser 


Arg 


Gly Ser Tyr 


Gin Leu Gin 


Ala Gin 










20 




25 




30 


Met 


Asn 


Arg 


Ala 


Val 


Tyr 


Glu 


Asp Arg Pro 


Pro Gly Ser 


Val Val 










35 






40 




45 


Pro 


Thr 


Ser 


Ala 


Ala 


Glu 


Ala 


Ser Arg Ala 


Met Ala Gly 


Asp Thr 










50 






55 




60 


Ser 


Leu 


Ser 


Glu 


Asn 


Tyr 


Ala 


Phe Ala Gly 


Met Tyr His 


Val Phe 










65 






70 




75 


Asp 


Gin 


His 


Val 


Asp 


Glu 


Ala 


Val Pro Arg 


Val Arg Phe 


Ala Asn 










80 






85 




90 


Asp 


Asp 


Arg 


His 


Arg 


Leu 


Ala 


Cys Cys Ser 


Leu Asp Gly 


Ser He 










95 






100 




105 


Ser 


Leu 


Cys 


Gin 


Leu 


Val 


Pro 


Ala Pro Pro 


Thr Val Leu 


Arg Val 










110 






115 




120 


Leu 


Arg 


Gly 


His 


Thr 


Arg 


Gly 


Val Ser Asp 


Phe Ala Trp 


Ser Leu 










125 






130 




135 


Ser 


Asn 


Asp 


He 


Leu 


Val 


Ser 


Thr Ser Leu 


Asp Ala Thr 


Met Arg 










140 






145 




150 
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He 


Trp Ala 


Ser 


Glu 


Asp 


Gly 


Arg 


Cys 


He 


Arg Glu He Pro 


Asp 










155 










160 




165 


Pro 


Asp 


Ser 


Ala 


Glu 


Leu 


Leu 


Cys 


Cys 


Thr 


Phe Gin Pro Val 


Asn 








170 










175 




180 


Asn 


Asn 


Leu 


Thr 


Val 


Val 


Gly 


Asn 


Ala 


Lys 


His Asn Val His 


Val 










185 










190 




195 


Met 


Asn 


He 


Ser 


Thr 


Gly 


Lys 


Lys Val 


Lys 


Gly Gly Ser Ser 


Lys 










200 










205 




210 


Leu 


Thr Gly 


Arc? 


Val 


Leu 


Ala 


Leu 


Ser 


Phe 


Asp Ala Pro Gly 


Arg 










215 










220 




225 


Leu 


Leu 


Trp 


Ala 


Gly 


Asp 


Asp 


Arg Gly 


Ser 


Val Phe Ser Phe 


Leu 










230 










235 




240 


Phe 


Asp Met 


Ala 


Thr 


Gly 


Lvs 


jjc u. 


1 ilX 


Lys 


Ala Lys Arg Leu 


Val 










245 










250 




255 


Val 


His 


Glu 


Gly 


Ser 


Pro 


Val 


Thr 


Ser 


He 


Ser Ala Arg Ser 


Trp 










260 










265 




270 


Val 


Ser 


Arg 


Glu 


Ala 


Arcr 


Asp 


Pro 


Ser 


Leu 


Leu lie Asn Ala 


Cys 










275 










280 




285 


Leu 


Asn 


Lys 


Leu 


Leu 


Leu 


Tyr 


Arg 


Val 


Val 


Asp Asn Glu Gly 


Thr 










290 










295 




300 


Leu 


Gin 


Leu 


Lvs 


Arcr 


Ser 


Phe 


Pro 


He 


Glu 


Gin Ser Ser His 


Pro 










305 










310 




315 


Val 


Arg 


Ser 


He 


Phe 


Cys 


Pro 


Leu 


Met 


Ser 


Phe Arg Gin Gly 


Ala 










320 










325 




330 


Cys 


Val 


Val 


Thr 


Gly 


Ser 


Glu 


Asp 


Met 


Cvs 


Val His Phe Phe 


Asp 










335 










340 




345 


Val 


Glu 


Arg 


Ala 


Ala 


Lys 


Ala 


Ala 


Val 


Asn 


Lys Leu Gin Gly 


His 










350 










355 




360 


Ser 


Ala 


Pro 


Val 


Leu 


Asp 


Val 


Ser 


Phe 


Asn 


Cys Asp Glu Ser 


Leu 










365 










370 




375 


Leu 


Ala 


Ser 


Ser 


Asp 


Ala 


Ser 


Gly Met 


Val 


lie Val Trp Arg 


Arg 










380 










385 




390 



Glu Gin Lys 

<210> 16 

<211> 485 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2216640CD1 

<400> 16 



Met 


Ala 


Ala 


Ala 


Val 


Ala 


Asp 


Glu 


Ala 


Val 


Ala Arg Asp Val 


Gin 


1 








5 










10 




15 


Arg 


Leu 


Leu 


Val 


Gin 


Phe 


Gin 


Asp 


Glu 


Gly 


Gly Gin Leu Leu 


Gly 










20 










25 




30 


Ser 


Pro 


Phe 


Asp 


Val 


Pro 


Val 


Asp 


He 


Thr 


Pro Asp Arg Leu 


Gin 










35 










40 




45 


Leu 


Val 


Cys 


Asn 


Ala 


Leu 


Leu 


Ala 


Gin 


Glu 


Asp Pro Leu Pro 


Leu 










50 










55 




60 


Ala 


Phe 


Phe 


Val 


His 


Asp 


Ala 


Glu 


lie 


Val 


Ser Ser Leu Gly 


Lys 










65 










70 




75 


Thr 


Leu 


Glu 


Ser 


Gin 


Ala 


Val 


Glu 


Thr 


Glu 


Lys Val Leu Asp 


lie 










80 










85 




90 


lie 


Tyr 


Gin 


Pro 


Gin 


Ala 


lie 


Phe 


Arg 


Val 


Arg Ala Val Thr 


Arg 










95 










100 




105 


Cys 


Thr 


Ser 


Ser 


Leu 


Glu 


Gly 


His 


Ser 


Glu 


Ala Val lie Ser 


Val 








110 










115 




120 


Ala 


Phe 


Ser 


Pro 


Thr 


Gly 


Lys 


Tyr 


Leu 


Ala 


Ser Gly Ser Gly 


Asp 










125 










130 




135 


Thr 


Thr 


Val 


Arg 


Phe 


Trp 


Asp 


Leu 


Ser 


Thr 


Glu Thr Pro His 


Phe 
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140 145 150 

Thr Cys Lys Gly His Arg His Trp Val Leu Ser lie Ser Trp Ser 

155 160 165 

Pro Asp Gly Lys Lys Leu Ala Ser Gly Cys Lys Asn Gly Gin lie 

170 175 180 

Leu Leu Trp Asp Pro Ser Thr Gly Lys Gin Val Gly Arg Thr Leu 

185 190 195 

Ala Gly His Ser Lys Trp He Thr Gly Leu Ser Trp Glu Pro Leu 

200 205 210 

His Ala Asn Pro Glu Cys Arg Tyr Val Ala Ser Ser Ser Lys Asp 

215 220 225 

Gly Ser Val Arg He Trp Asp Thr Thr Ala Gly Arg Cys Glu Arg 

230 235 240 

He Leu Thr Gly His Thr Gin Ser Val Thr Cys Leu Arg Trp Gly 

245 250 255 

Gly Asp Gly Leu Leu Tyr Ser Ala Ser Gin Asp Arg Thr He Lys 

260 265 270 

Val Trp Arg Ala His Asp Gly Val Leu Cys Arg Thr Leu Gin Gly 

275 280 285 

His Gly His Trp Val Asn Thr Met Ala Leu Ser Thr Asp Tyr Ala 

290 295 300 

Leu Arg Thr Gly Ala Phe Glu Pro Ala Glu Ala Ser Val Asn Pro 

305 310 315 

Gin Asp Leu Gin Gly Ser Leu Gin Glu Leu Lys Glu Arg Ala Leu 

320 325 330 

Ser Arg Tyr Asn Leu Val Arg Gly Gin Gly Pro Glu Arg Leu Val 

335 340 345 

Ser Gly Ser Asp Asp Phe Thr Leu Phe Leu Trp Ser Pro Ala Glu 

350 355 360 

Asp Lys Lys Pro Leu Thr Arg Met Thr Gly His Gin Ala Leu He 

365 370 375 

Asn Gin Val Leu Phe Ser Pro Asp Ser Arg He Val Ala Ser Ala 

380 385 390 

Ser Phe Asp Lys Ser He Lys Leu Trp Asp Gly Arg Thr Gly Lys 

395 400 405 

Tyr Leu Ala Ser Leu Arg Gly His Val Ala Ala Val Tyr Gin He 

410 415 420 

Ala Trp Ser Ala Asp Ser Arg Leu Leu Val Ser Gly Ser Ser Asp 

425 430 435 

Ser Thr Leu Lys Val Trp Asp Val Lys Ala Gin Lys Leu Ala Met 

440 445 450 

Asp Leu Pro Gly His Ala Asp Glu Val Tyr Ala Val Asp Trp Ser 

455 460 465 

Pro Asp Gly Gin Arg Val Ala Ser Gly Gly Lys Asp Lys Cys Leu 

470 475 480 

Arg He Trp Arg Arg 

485 

<210> 17 

<211> 199 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2417361CD1 



<400> 17 

Met Asn Pro Arg Lys Lys Val Asp Leu Lys Leu He He Val Gly 
15 10 15 

Ala He Gly Val Gly Lys Thr Ser Leu Leu His Gin Tyr Val His 

20 25 30 

Lys Thr Phe Tyr Glu Glu Tyr Gin Thr Thr Leu Gly Ala Ser He 

35 40 45 
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Leu Ser 


Lys 


He 


He 


He Leu Gly Asp Thr 


Thr Leu Lys Leu 


Gin 








50 






55 








60 


He Trp 


Asp 


Thr 


Glv 


Gly Gin Glu Arg Phe 


Arcr 


Ser Met 


Val 


Ser 








65 






70 








75 


Thr Phe 


Tyr 


Lvs 


Glv 


Ser Asp Gly Cys lie 


Leu 


Ala Phe 


Asp 


Val 








80 






85 








90 


Thx Asp 


Leu 


Glu 


Ser 


Phe 


Glu 


Ala Leu Asp 


He 


Trp Arg 


Gly 


Asp 






95 






100 








105 


Val Leu 


Ala 


LVS 


He 


Val 


Pro 


Met Glu Gin 


Ser 


Tyr Pro 


Met 


Val 








110 






115 








120 


Leu Leu 


Glv 


Asn 


Lvs 


He Asp Leu Ala Asp 


Arg Lys Val 


Pro 


Gin 






125 






130 








135 


Glu Val 


Ala 


Gin 


Glv 


Trp Cys Arg Glu Lys 


Asp 


lie Pro 


Tyr 


Phe 








140 






145 








150 


Glu Val 


Ser 


Ala 


Lys 


Asn Asp 


lie Asn Val 


Val 


Gin Ala 


Phe 


Glu 








155* 






160 








165 


Met Leu 


Ala 


Ser 


Arg 


Ala 


Leu 


Ser Arg Tyr 


Gin 


Ser He 


Leu 


Glu 








170 






175 








180 


Asn His 


Leu 


Thr 


Glu 


Ser 


He 


Lys Leu Ser 


Pro Asp Gin 


Ser 


Arg 








185 






190 








195 


Ser Arg 


Cys 


Cys 


















<210> 18 




















<211> 163 




















<212> PRT 




















<213> Homo sapiens 
















<220> 






















<221> misc_feature 
















<223> Incyte ID 


No: 


2454384CD1 










<400> 18 




















Met Glu 


Gly Pro 


Ser 


Leu 


Arg 


Gly Pro Ala 


Leu Arg Leu 


Ala 


Glv 


1 






5 






10 








15 


Leu Pro 


Thr 


Gin 


Gin 


Asp 


Cys 


Asn lie Gin 


Glu Lys He Asp 


Leu 








20 






25 








30 


Glu He 


Arg 


Met 


Arg 


Glu 


Gly 


He Trp Lys 


Leu 


Leu Ser 


Leu 


Ser 








35 






40 








45 


Thr Gin 


Lys Asp 


Gin 


Val 


Leu 


His Ala Val 


Lys 


Asn Leu 


Met 


Val 








50 






55 








60 


Cys Asn 


Ala 


Arg 


Leu 


Met 


Ala 


Tyr Thr Ser 


Glu 


Leu Gin 


Lys 


Leu 








65 






70 








75 


Glu Glu 


Gin 


He 


Ala 


Asn 


Gin 


Thr Gly Arg 


Cys 


Asp Val 


Lys 


Phe 








80 






85 








90 


Glu Ser 


Lys 


Glu 


Arg 


Thr 


Ala 


Cys Lys Gly 


Lys 


He Ala 


lie 


Ser 








95 






100 








105 


Asp He 


Arg 


He 


Pro 


Leu 


Met 


Trp Lys Asp 


Ser 


Asp His 


Phe 


Ser 








110 






115 








120 


Asn Lys 


Glu 


Arg 


Ser 


Arg 


Arg 


Tyr Ala lie 


Phe 


Cys Leu 


Phe 


Lys 








125 






130 








135 


Met Gly 


Ala 


Asn 


Val 


Phe 


Asp 


Thr Asp Val 


Val 


Asn Val 


Asp 


Lys 








140 






145 








150 


Thr He 


Thr Asp 


lie 


Cys 


Phe 


Glu Asn Val 


Thr 


lie Leu 












155 






160 










<210> 19 




















<211> 290 




















<212> PRT 




















<213> Homo 


sapiens 
















<220> 






















<221> misc_ 


feature 
















<223> Incyte ID 


No: 


2610262CD1 
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f Ala Ala ("Sin 


He 




XT i. \J 


Lys 


Pro 


1 


-> 










T en i T at i Pin & ttt 

Leu Leu bin Aiy 


l v lc U 


blU 


pi \r 
oiy 


Ser 


Gin 




Z V 










Vdl lie Val fiO 


Lys 


P.1 ii 
ulU 


pi ii 

ulU 


Gly Val 














A rrr T'Viy" Val Srrr 
AXy 1 ilX Vdl AXy 


VQl 


Trp 


T oi i 
JjcU 


Lys Arg 














r'xO del val J-jr^ 


m a 


Ala 
Ala 




Pro 


Ser 




Q -J 










irlie ASil riO blU 


rpT- — - 

i nr 


Arg 


Arg 


Leu 


Ser 




ou 










x nr lie ber biu 


rue 


lie 


Leu 


Ser 


Glu 














Ir iO Val LiyS ASn 


Tyr 


vjin 


AT a 
Ala 


His 


Gin 




11 u 










Jjcu rile Val JjcU 


pi 


Leu 


pi 1 1 
Val LI 


Trp 


Val 




1Z _> 










J_»yS Gin rTle Ala 


Trp 


fllS 


Cys 


Ser 


Glu 




14U 










oiy lyr Aiy 1 nx 


Car 


AT a 
Ala 


Val 
Val 


Ala 


Ser 




ICC 










Glu Tfir Arg his 


vai 


Pne 


Tl Q 

lie 


Gly Asp 




1 / u 










lie Lieu uys j_*eu 


p.1 ii 

ulu 


Pin 

uin 


P.1 ii 


Asn Cys 




1 QC 

lOD 










Arg Gly His Thr 


Giy 


Giy 


Val 


Thr 


Ala 




z UU 










Gin Arg Val Lieu 


rue 


ber 


Pi i.r 

vjiy 


Ser 


Ser 




Z ID 










Trp Asp He G ly 


pi.. 

Giy 


Arg 


Lys 


Gly 


Thr 




ft 

z o U 










His Asn Asp Arg 


vai 


pi — 
G±n 


«i. 
Aia 


Leu 


Ser 














Gin Leu He Ser 


Cys 


Gly 


Gly 


Asp 


Gly 




Z o U 










Met ASp Val G1U 


Arg 


pi _ 
Gin 


Pi ii 
G1U 


Pro 


Leu 




275 










Met He Ser Ala 


Val 












290 










<210> 20 












<211> 705 












<212> PRT 












<213> Homo sapiens 










<220> 












<221> misc_feature 










<223> Incyte ID 


No: 


2700075CD1 




<400> 20 












Met Gly Thr Trp 


Glu 


His 


Leu 


Val 


Ser 


1 


5 










Arg Glu Arg Glu 


Val 


Lys 


Leu 


Trp Asp 




20 










Ala Leu Ala Ser 


Leu 


Thr 


Leu 


Asp 


Thr 




35 










Pro Leu Leu Asp 


Pro 


Asp 
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Ser 


Arg 


Ser 


Thr 


Asp 


Arg Val 


Leu 


Gin Trp 


Glu 


Arg 


lie 










125 








130 








135 


Asp 


Cys 


Gin 


Gly 


lie 


Pro 


Pro 


Ser Ser 


Lys 


Asp Lys 


Leu 


Gly 


Val 
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140 










145 






150 




Val 


TVr 
lyx 


Lvs Asn 


Lys 


Leu 


He 


Phe 


Phe 


Gly Gly 


Tyr Gly Tyr 








155 
i -> -» 










160 






165 


Leu 


Pro 


Glu 


Asn Lvs 


Val 


Leu 


Gly Thr 


Phe 


Glu 


Phe 


Asp. Glu Thr 








170 










175 






180 


Ser 


Phe 


Tm 
lip 


Asn Ser 


Ser 


His 


Pro Arg Gly 


Trp Asn 


Asp His Val 






185 










190 






195 


His 


lie 


Leu 


Asp Thr 


Glu 


Thr 


Phe 


Thr 


Trp 


Ser Gin 


Pro He Thr 








200 










205 






210 


Thr 




Lys 


Ala Pro 


Ser 


Pro 


Arg Ala Ala 


His 


Ala 


Cys Ala Thr 




215 










220 






225 


vol 


m v 


Asn 


1T.X y Ol y 


Phe 


Val 


Phe 


Gly Gly 


Arg Tyr 


Arg Asp Ala 








230 










0 3 5 
Z J J 






240 


Arg 


ric u 


Asn 


no jucu 


His 


Tvr 
lyx 


Leu 


Asn 


Leu 


Asp 


Thr 


Trp Glu Trp 








£t *± J 










i c n 






255 


Asn 


Glu. 


Leu 


He Pro 


Gin 


Glv 


He 


Cys 


Pro 


Val Gly 


Arg Ser Trp 


















o c c 
^ b b 






270 


His 


Ser 


Leu 


Thr Pro 


val 


Ser 


Ser Asp 


His 


Leu 


Phe 


Leu Phe Gly 


















280 






285 


\j xy 




Thr 


11 IX 


Lys 


Gin 


Pro 


Leu 


Ser 


Asp Ala 


Trp Thr Tyr 
















295 






300 


cys 


He 


Cor 




UlU 


ixp 


He 


Gin 


Phe 


Asn 


His 


Pro Tvr Thr 






J> U J 










310 






315 


VjXU 


T 


STL \J 


AX y LcU 


irp 


Hi Q 
nis 


Thr 


Ala 


Cys 


Ala 


Ser 


Aqn Gl u Glv 






_> ^ u 










325 






330 


Glu 


Val 


J. 1c 


val IT lie 


uiy 


v?xy 


Cys 


Ala 


Asn 


Asn 


Leu 


Leu Val His 








J J J 










340 






345 


nlS 


Arg 


Ala 


nld nib 


Car 
OCX 


Asn 


Glu 


He 


Leu 


He 


Phe 


Ser Val Gin 
















355 






360 


Pro 


Lys 


Ser 


Leu Val 


Arg 


Leu 


Ser 


Leu 


Glu 


Ala 


Val 


He Cys Phe 








365 










370 






375 


Lys 


Glu 


Met 


Leu Ala 


Asn 


Ser 


Trp 


Asn 


Cys 


Leu 


Pro 


Lys His Leu 






380 










385 






390 


Leu 


His 


Ser 


Val Asn 


Gin 


Arg 


Phe Gly 


Ser 


Asn 


Asn 


Thr Ser Gly 








395 










400 






405 



Ser 
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Met 


Ala 


Glu 


Glu 


Met 


Glu 


Ser 


Ser 


Leu Glu 


Ala Ser 


Phe Ser Ser 


1 








5 








10 




15 


Ser 


Gly 


Ala 


Val 


Ser 


Gly 


Ala 


Ser Gly Phe 


Leu Pro 


Pro Ala Arg 








20 








25 




30 


Ser 


Arg 


He 


Phe 


Lys 


He 


He 


Val 


He Gly 


Asp Ser 


Asn Val Gly 










35 








40 




45 


Lys 


Thr 


Cys 


Leu 


Thr 


Tyr 


Arg 


Phe 


Cys Ala 


Gly Arg 


Phe Pro Asp 








50 








55 




60 


Arg 


Thr 


Glu 


Ala 


Thr 


He 


Gly 


Val 


Asp Phe 


Arg Glu 


Arg Ala Val 










65 








70 




75 


Glu 


He 


Asp 


Gly 


Glu 


Arg 


He 


Lys 


He Gin 


Leu Trp 


Asp Thr Ala 










80 








85 




90 


Gly 


Gin 


Glu 


Arg 


Phe 


Arg 


Lys 


Ser 


Met Val 


Gin His 


Tyr Tyr Arg 










95 








100 




105 


Asn 


Val 


His 


Ala 


Val 


Val 


Phe 


Val 


Tyr Asp 


Met Thr 


Asn Met Ala 










110 








115 




120 
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Ser 


Phe 


His 


Ser 


Leu 


Pro 


Ser 


Tro 


He 


Glu 


Glu 


Cys 


Lys 


Gin 


His 










125 










130 










135 


Leu 


Leu 


Ala 


Asn 


Asp 


lie 


Pro 


Ara 


He 


Leu 


Val 


Gly Asn Lys 


Cvs 










140 










145 










150 


Asp 


Leu 


Ara 


Ser 


Ala 


He 


Gin 


Val 


Pro 


Thr 


Asp 


Leu 


Ala 


Gin 


Lys 










155 










160 










165 


Phe 


Ala 


Asp 


Thr 


His 


Ser 


Met 


Pro 


Leu 


Phe 


Glu 


Thr 


Ser 


Ala 


Lys 










170 










175 










180 


Asn 


Pro 


Asn 


Asp 


Asn 


Asp 


His 


Val 


Glu 


Ala 


He 


Phe 


Met 


Thr 


Leu 










185 










190 










195 


Ala 


His 


Lys 


Leu 


Lys 


Cys 


His 


Lys 


Pro 


Leu 


Met 


Leu 


Ser 


Gin 


Pro 










200 










205 










210 


Pro 


Asp 


Asn 


Gly 


lie 


He 


Leu 


Lys 


Pro 


Glu 


Pro 


Lys 


Pro 


Ala 


Met 










215 










220 










225 


Thr 


Cys 


Trp 


Cys 
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Met 


o 


Ser 




Lys 


Cca-r- 


Ala 


Arg 


Tyr 


Asn 


Irn 'DV\ a Coy (~Z ~) \r 

ai y xriic ocx uiy 


ri~\ vr 


i 








c 
D 










10 






Pro 


Ser 


Asn 


Leu 


Pro 


TVit- 
1 1 IX 






Val 


Thr 


i hi. vj x_y i in. ni y 


Mf=»t" 










z u 










25 




7. 0 
J U 


Glu 


Thr 


Thr 


Phe 


Glv 


Pro 


Ala 


Phe 


Ser 


Ala 


Va 1 Thr Thr He 


Thr 










35 










40 




45 


Lys 


Ala 


Asp 


Gly 


Thr 


Ser 


Thr 


Tyr 


Lys 


Gin 


His Cys Arg Thr 


Pro 










50 










55 




60 


Ser 


Ser 


Ser 


Ser 


Thr 


Leu 


Ala 


Tyr 


Ser 


Pro 


Arg Asp Glu Glu 


Asp 










65 










70 




75 


Ser 


Met 


Pro 


Pro 


He 


Ser 


Thr 


Pro 


Arg Arg 


Ser Asp Ser Ala 


He 










80 










85 




90 


Ser 


Val 


Arg 


Ser 


Leu 


His 


Ser 


Glu 


Ser 


Ser 


Met Ser Leu Arg 


Ser 










95 










100 




105 


Thr 


Phe 


Ser 


Leu 


Pro 


Glu 


Glu 


Glu 


Glu 


Glu 


Pro Glu Pro Leu 


Val 










110 










115 




120 


Phe 


Ala 


Glu 


Gin 


Pro 


Ser 


Val 


Lys 


Leu Cys 


Cys Gin Leu Cys 


Cys 










125 










130 




135 


Ser 


Val 


Phe 


Lys 


Asp 


Pro 


Val 


He 


Thr 


Thr 


Cys Gly His Thr 


Phe 










140 










145 




150 


Cys 


Arg 


Arg 


Cys 


Ala 


Leu 


Lys 


Ser 


Glu 


Lys 


Cys Pro Val Asp 


Asn 










155 










160 




165 


Val 


Lys 


Leu 


Thr 


Val 


Val 


Val 


Asn 


Asn 


He 


Ala Val Ala Glu 


Gin 










170 










175 




180 


He 


Gly 


Glu 


Leu 


Phe 


He 


His 


Cys 


Arg 


His 


Gly Cys Arg Val 


Ala 










185 










190 




195 


Gly 


Ser 


Gly 


Lys 


Pro 


Pro 


He 


Phe 


Glu. Val 


Asp Pro Arg Gly 


Cys 










200 










205 




210 


Pro 


Phe 


Thr 


He 


Lys 


Leu 


Ser 


Ala 


Arg 


Lys 


Asp His Glu Gly 


Ser 










215 










220 




225 


Cys 


Asp 


Tyr 


Arg 


Pro 


Val 


Arg 


Cys 


Pro 


Asn 


Asn Pro Ser Cys 


Pro 










230 










235 




240 


Pro 


Leu 


Leu 


Arg 


Met 


Asn 


Leu 


Glu 


Ala 


His 


Leu Lys Glu Cys 


Glu 










245 










250 




255 


His 


He 


Lys 


Cys 


Pro 


His 


Ser 


Lys 


Tyr Gly 


Cys Thr Phe He 


Gly 










260 










265 




270 


Asn 


Gin 


Asp 


Thr 


Tyr 


Glu 


Thr 


His 


Leu 


Glu 


Thr Cys Arg Phe 


Glu 
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275 



Glv 


Leu Lvs Glu 


Phe 
290 


Leu 


Gin 


Gin 


Met 


His Val Ala 


Leu 
305 


Ala 


Gin 


Lvs 


Arg 


Ser Met Leu 


Glv 
320 


Lys 


Leu 


Ser 


xjy o 


Cpy Tien C4l u 


Leu 
335 


Lys 


Phe 


Asp 


T,ve 


T.on Cor ("21 11 

IJCU OCX ul u 


Asp 
350 


Leu 


Met 


Glu 


Leu 


a on A«3T"> Rill 


Leu 
365 


Ser 


His 


He 


lie 


"Leu Glv Ser 

J-iCU UlJ OCJ. 


Tvr* 
xyx 

380 


Asp 


Pro 


Gin 


Thr 


Phe Val Glv 


His 
395 


Gin 


Glv 


Pro 


Ser 


Met Glv Asd 

J-«l^Lf \J^Y 


Leu 
410 


Leu 


Phe 


Ser 


Lys 


Val liy 


Thr 
425 


uy t> 


Thr 


Thr 


ni ii 


VJiy mo rVo^J 


fil v 
oiy 

440 


Tl e 

11C 


v a i 


T.oi i 

UCU 


Lys 


bcU iyi OCX 


f2l v 

v?iy 
455 


ocx 


Al a 
nla 




Tl e 


nsll UCU 


Gin 
470 


xjy o 


Val 


Asn 


Pro 


Val Pvres Tn-r 
Val v^yo XXIX 


T 01 1 

485 


val 


Car 
Ocl 


Car 

OC1 


Ser* 


Lcll liyS nla 


Tie 
lie 

500 


Lys 


Val 


irp 


Lys 


lieu L/ys Juy» 


olU 

515 


T.aii 
JjcU. 


Th-r 
1 ill 


oiy 


Leu 


XTst 1 lis Ala 
Val AA.SL Ala 


r«l rt 

530 


Ser 


Tyr 


Leu 


Tl o 
lie 


Liy o lie lX^' 


Asp 
545 


Tip 
lie 


Arg 


Thr 

1 XIX 


ulil 


TP Vl V" Car 1 \/ 
1 111 OCX uiy 


rsl vr 
oiy 

560 


Ocl 


Val 

Val 


l^X 


nio 


lie val Lys 


\j iy 
575 


x in 


lyr 


Ri \i 

Vjl u 


yip 
lie 


fTllii Cot" T \/c 

vjiu ocx ijy o 


nl n 

blU 

590 


fit n 

111 


Val 
Val 


A rrr 

Aig 


Thr 


Val Tvr Ala 
vox iyi nxa 


Leu 
605 


Ala 


Val 


He 


val 


Phe Ser Ala 


Ser 
620 


Tyr 


Asp 


Arg 


Asp 


Asn Met He 


Cys 
635 


Thr 


Gin 


Thr 


Val 


Thr Ala Leu 


Ala 
650 


Val 


Ser 


Arg 


Val 


Asp Ser Thr 


Val 


Lys 


Val 


Trp 



665 
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280 






285 


Thr Asp 


Asp Arg 


Phe 


His Glu 




295 






300 


Asp Gin 


Glu He 


Ala 


Phe Leu 




310 






315 


Glu Lys 


He Asp Gin 


Leu Glu 




325 






330 


Val 


Leu 


Asp Glu Asn 


Gin Ser 




340 






345 


Phe Arg 


Arg Asp Ala 


Ser Met 




355 






360 


Asn Ala 


ni y Lieu 


noil 


Met Gly 




J f V 






375 


ulll 


Tie 

1 1C 


Phe Lys 


Cys 


Lvs Glv 




■a ot: 






390 


Val 


Trn 


Cys Leu 


Cys 


Val Tyr 




Ann 






405 


f*l \r 

vjiy 


Ser 


Ser Asp 


Lys 


Thr He 




Al 
413 






420 


Tyr 


Lys 


Cys Gin 


Lys 


Thr Leu 




4.5U 






435 


Aia 


Leu 


Cys He 


Gin 


Gly Cys 




/I /I c 






450 


cys 


Thr 


He He 


Val 


Trrj Aso 










465 


Thr 


T 1 a 

lie 


Arg Ala 


His 






yi -l r 






480 


HIS 


Asn 


Val Leu 


Phe 


Ser Gly 




4y o 






495 


Asp 


lie 


Val Gly Thr 


(2 1 1 1 T .01 1 
VJIU XjC u. 




C A C 






510 


Leu 


Asn 


His Trp Val 


T\ rrr Ala 

Aig Ala 




520 






525 


Tyr 


Ser 


Gly Ser 


Tyr 


Gin Thr 




535 






540 


Leu 


Asp 


Cys He 


His 


Va 1 Leu 




550 






555 


Ser 


He 


Ala Val 


Thr 


A^n Hi q 




565 






570 


Asn 


Leu 


He His 


Val 


Trp Asp 




580 






585 


Thr 


Leu 


Thr Gly His 


Val filv 

Val oxy 




595 






600 


Ser 


Thr 


Pro Asp Gin 


Thr Lys 




610 






615 


Ser 


Leu 


Arg Val 


Trp 


Ser Met 




625 






630 


Leu 


Leu 


Arg His 


Gin 


Ser Ser 




640 






645 


Gly Arg 


Leu Phe 


Ser 


Gly Ala 




655 






660 


Thr 


Cys 










670 









25/115 



WO 01/05970 



PCT/US00/19698 



Met Lys Val 
1 

Glu Thr Lys 

Ala Leu His 

Asn Ala Thr 

Ser Leu Asp 

Pro Glu Lys 

Val Arg He 

Gin Ala His 

Gly Thr Ser 

Trp Lys Met 

His Thr He 

Trp Lys Glu 

Trp Asp Glu 

Phe Asp Ser 

Leu Leu Gly 

Met Arg Gin 

Thr Asn Thr 

Ala Ala Asn 

Leu Asp Thr 

Leu Asp Val 

Ser Phe Asp 

Ser Arg Glu 

Val Lys Trp 

Glu Met Asn 

Gly Val Leu 

Lys Leu Lys 

Ala Arg His 

Glu Gin Arg 

Arg He Lys 

Lys Lys Lys 
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Leu 
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Leu 


Asp 


T .01 1 




20 




Pro 


Phe 
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Lys 


Leu 


Ui. Li 
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Gly His 


Arg 
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Leu 


Ala 


inr 




80 




Trp 


Asn 


Leu 
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Glu Gly 


rile 




110 




Phe 


Phe 


Thr 




125 




Asp Gly 


Pro 




140 




Leu Gly 


Lys 




155 




Ala 


Val 


pne 




170 




Gin 


Arg 


Thr 




185 




He 


Ser 


Ser 




200 




Ser 


Cys 


Ala 




215 




Ala 


Thr 


Pro 




230 




He 


Cys 


Trp 




245 




Glu Asp 


Tyr 




260 




Pro 


Val 


Met 




275 




Asp Tyr 


ser 




290 




Lys 


Ser 


lie 




305 




Val 


Tyr 


His 




320 




Thr 


Ser 


Asp 




335 




He 


Arg 


Leu 




350 




Thr 


Ser 


Arg 




365 




Glu 


Lys 


r>V> /-^ 

rne 




380 




Arg 


His 


T m 1 

Leu 




395 




He 


Met 


Lys 




410 




His 


Ser 


Lys 




425 




His 


Val 


Val 




440 





Ser Arg Asn 
Gin Arg Val 
Val Pro Arg 
Arg Val Phe 
Asp Gly Val 
Val Leu Ser 
Thr Gin Arg 
Val Arg Gly 
Val Gly Asp 
Gly Tyr Gly 
Thr Val Tyr 
Ala Thr Cys 
Asn Pro He 
Val Lys Phe 
Ser Asp Arg 
Leu Lys Lys 
Asn Pro Met 
Asn Leu Tyr 
Val His Met 
Pro Thr Gly 
Arg He Phe 
Thr Lys Arg 
Ser Lys Tyr 
Trp Lys Ala 
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Gin His Tyr 
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Pro Gly Ser 
Ala Val Val 



Pro 


Asp Asn 


± u 






Pro 


Arg 


Asn 


£. D 








Tyr 


Val 










Lys 


Pro 


D 0 






Asn 


Cys 


Leu 


l u 








Ala 


Cys 


Q C 






Asn 


Cys 
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1 no 
1UU 








Cys 


Thr 


11D 






Asp 


Lys 


Thr 


n "in 

iju 






Asp 


Glu 


Glu 


1 4D 






Thr 


Gly He 


1 bO 






Gly 


Gin 


Gin 


1 /D 






Cys 


Ser 


Met 


1 OA 

iyo 






Asn 


Pro 


He 








Asn 


He 


Val 


220 






val 


He 


Leu 


235 






pi,. 


Ala 


Phe 


Z OU 






T.nr 


Phe 


Asp 


Z. DO 






Asp 


His 


Val 


£ OU 






Lys 


Glu 


Phe 








Pro 


Val 


Asp 


■5 1U 






Mec 


Gin 


His 








lie 


Met 


Cys 


"2 a n 






Asn 


Ala 


Ser 


ODD 






nla 


Lys 


Asp 


0 / u 






Pro 


His 


He 


"IOC 

OOD 






Tl ^ 

JL J. tr 


Tyr 


Ser 








Arg 


Arg Lys 


415 






Val 


Pro 


Leu 


430 






Lys 






445 







Tyr Val Arg 
15 

Tyr Asp Pro 
30 

Arg Ala Leu 
45 

Phe Leu Ala 
60 

Ala Lys His 
75 

Asp Gly Glu 
90 

Arg Thr He 
105 

Arg Phe Cys 
120 

Val Lys Gin 
135 

Glu Pro Leu 
150 

Asp His His 
165 

Val Asp He 
180 

Thr Trp Gly 
195 

Glu Thr Phe 
210 

Leu Tyr Asp 
225 

Asp Met Arg 
240 

He Phe Thr 
255 

Met Arg Ala 
270 

Ser Ala Val 
285 

Val Ser Ala 
300 

Lys Ser Arg 
315 

Val He Cys 
330 

Gly Ser Asp 
345 

Glu Lys Leu 
360 

Tyr Asn Gin 
375 

Lys Arg He 
390 

Gin lie Gin 
405 

Glu Val Asn 
420 

Val Ser Glu 
435 
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Met Ser Asp 
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Leu 


Ser 


Pro 
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Val Asp Ala Phe 
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Cys 


Phe 


Ala Ser 
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Gin Asn Phe 


Glu 


Glu 


Gin 


Met 


Gin Gly 






35 








Gin Leu Ser 


Thr 


Leu 


Leu 


Arg 


Leu Leu 
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Tyr Leu Glu 


Ser 


Gin 
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Ser 


Gly Tyr 






65 








Trp Leu Leu 


He 


Arg 


Phe 


Lys 


Arg Glu 






80 








Leu Arg Leu 


Trp 


Glu 


Val 


Met 


Trp Thr 






95 








Phe His Leu 


Leu 


Leu 


Cys 


Cys 


Ala He 






110 








Gin He Met 


Glu 


Lys 


His 


Tyr 


Gly Phe 
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He Asn Glu 


Leu 


Ser 


Met 


Lys 


lie Asp 






140 








Lys Ala Glu 


Ala 


He 


Ser 


Leu 


Gin Met 
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Pro Gin Ala 


Val 


Cys 


Glu 


He 


Leu Gly 






170 








Thr Thr Pro Asp 


Ser 


Asp 


Val 


Gly Glu 






185 








Thr Pro Cys 


Pro 


Thr 


Ser 


Ala 


Phe Gin 






200 








Leu Ser Ala 


Ser 


Gly 


Ala 


Arg 


Asn Asp 






215 








Val Ser Ser Asp 


Val 


Cys 


Arg 


Leu Thr 
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Met Gin Glu 


Ser 


Gly 


Cys 


Arg 


Leu Glu 


1 




5 








Phe Arg Asn 


His 


Val 


Met 


Glu 


Gly Asp 






20 








Asp Leu Asn 


Glu 


Leu 


Lys 


Pro 


Leu Val 






35 








Val Arg Met 


Lys 


Phe 


Leu 


Leu 


Leu Gin 






50 








Leu Glu Asp 


Gly 


Lys 


Val 


Leu 


Glu Ala 
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Glu Leu Thr 


Pro 


Leu 


Lys 


Tyr 


Asn Thr 






80 








Ser Gly Tyr 


Leu 


Met 


Cys 


Ser 


His Ala 






95 
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Tyr Val Met 


Glu 


Asn 


Glu 


10 






15 


Tyr Met Asp 


Gin 


Met 


His 


25 






30 


Met Lys Thr 


Gin 


Leu 


He 


40 






45 


Asp Ser Gly 


Phe 


Cys 


Ser 


55 






60 


Leu Tyr Phe 


Cys 


Phe 


Arg 


70 






75 


Phe Ser Phe 


Leu 


Asp 


He 


85 






90 


Glu Leu Pro 


Cys 


Thr 


Asn 


100 






105 


Leu Glu Ser 


Glu Lys 


Gin 


115 






120 


Asn Glu lie 


Leu 


Lys 


His 


130 






135 


Val Glu Asp 


lie 


Leu Cys 


145 






150 


Val Lys Cys 


Lys 


Glu 


Leu 


160 






165 


Leu Gin Gly 


Ser 


Glu 


Val 


175 






180 


Asp Glu Asn 


Val 


Val 


Met 


190 






195 


Ser Asn Ala 


Leu 


Pro 


Thr 


205 






210 


Ser Pro Thr 


Gin 


He 


Pro 


220 






225 


Pro Ala 








235 









His Pro Ser 


Ala 


Thr 


Lys 


10 






15 


Trp Asp Lys 


Ala 


Glu 


Asn 


25 






30 


His Ser Pro 


His 


Ala 


lie 


40 






45 


Gin Lys Tyr 


Leu 


Glu 


Tyr 


55 






60 


Leu Gin Val 


Leu 


Arg 


Cys 


70 






75 


Glu Arg lie 


His 


Val 


Leu 


85 






90 


Glu Asp Leu 


Arg Ala Lys 


100 






105 
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Ala Glu 


Trp 


Glu 


Gly 


Lys Gly 


Thr Ala 


Ser 


Arg 


Ser Lys 


Leu Leu 








110 








115 






120 


Asp Lys 


Leu 


Gin 


Thr 


Tyr 


Leu 


Pro Pro 


Ser 


Val 


Met Leu 


Pro Pro 








125 








130 






135 


Ara Ara 


Leu 


Gin 


Thr 


Leu 


Leu 


Arg Gin 


Ala 


Val 


Glu Leu 


Gin Arg 








140 








145 






150 


Asp Arg 


Cys 


Leu 


Tyr 


His 


Asn 


Thr Lys 


Leu 


Asp 


Asn Asn 


Leu Asp 








155 








160 






165 


Ser Val 


Ser 


Leu 


Leu 


lie Asp 


His Val 


Cys 


Ser 


Arg Arg 


Gin Phe 








170 








175 






180 


Pro Cys 


Tvr 


Thr 


Gin 


Gin 


He 


Leu Thr 


Glu 


His 


Cys Asn 


Glu Val 








185 








190 






195 


Trp Phe 


Cvs 


Lvs 


Phe 


OCX 


Asn 


Asp Gly 


Thr 


Lys 


Leu Ala 


Thr Gly 








200 








205 






210 


Ser Lys 


Asp 


Thr 


Thr 


Val 


He 


lie Trp 


Gin 


Val 


Asp Pro 


Asp Thr 








215 








220 






225 


His Leu 


Leu 


Lvs 


Leu 


Leu 


Lys 


Thr Leu 


Glu 


Gly His Ala 


Tyr Gly 








230 








235 






240 


Val Ser 


Tvr 


lie 


Ala 


Trp 


Ser 


Pro Asp 


Asp 


Asn 


Tyr Leu 


Val Ala 








245 








250 






255 


Cys Gly 


Pro 


ASP 


Asp 


Cys 


Ser 


Glu Leu 


Trp 


Leu 


Trp Asn 


Val Gin 








260 








265 






270 


Thr Gly 


Glu 


Leu 


Ara 


Thr 


Lys 


Met Ser 


Gin 


Ser 


His Glu 


Asp Ser 








275 








280 






285 


Leu Thx 


Ser 


Val 


Ala 


Trp 


Asn 


Pro Asp 


Glv 


Lys 


Arg Phe 


Val Thr 








290 








295 






300 


Gly Gly 


Gin 


Arg 


Glv 

J: 


Gin 


Phe 


Tyr Gin 


Cvs 

w Jr -* 


Asp 


Leu . Asp 


Gly Asn 








305 








310 






315 


Leu Leu 


Asp 


Ser 


Trr> 


Glu Gly 


Val Arg 


Val 


Gin Cys Leu 


Trp Cvs 








320 








325 






330 


Leu Ser 


Asp 


Glv 


Lys 


Thr 


Val 


Leu Ala 


Ser 


Asp 


Thr His 


Gin Arg 








335 








340 






345 


lie Arg 


Glv 


Tvr 


Asn 


Phe 


Glu 


Asp Leu 


Thr 


Asp Arg Asn 


lie Val 








350 








355 






360 


Gin Glu 


Asp 


His 


Pro 


He 


Met 


Ser Phe 


Thr 


He 


Ser Lys 


Asn Gly 








365 








370 






375 


Arg Leu 


Ala 


Leu 


Leu 


Asn 


Val 


Ala Thr 


Gin 


Gly Val His 


Leu Trp 








380 








385 






390 


Acn Leu 


Gin 


Asp 


Ara 


Val 


Leu 


Val Arg 


Lys 


Tyr Gin Gly 


Val Thr 








395 








400 






405 


Gin Gly 


Phe 


Tvr 


Thr 


He 


His 


Ser Cys 


Phe 


Gly Gly His 


Asn Glu 








410 








415 






420 


Asp Phe 


lie 


Ala 


Ser 


Gly Ser 


Glu Asp 


His 


Lys Val Tyr 


lie Trp 








425 








430 






435 


His Lys 


Ara 


Ser 


Glu 


Leu 


Pro 


He Ala 


Glu 


Leu Thr Gly 


His Thr 








440 








445 






450 


Arg Thr 


Val 


Asn 


Cys 


Val 


Ser 


Trp Asn 


Pro 


Gin 


He Pro 


Ser Met 








455 








460 






465 


Met Ala 


Ser 


Ala 


Ser 


Asp Asp 


Gly Thr 


Val 


Arg 


lie Trp 


Gly Pro 








470 








475 






480 


Ala Pro 


Phe 


lie 


Asp 


His 


Gin 


Asn He 


Glu 


Glu 


Glu Cys 


Ser Ser 








485 








490 






495 



Met Asp Ser 

<210> 29 

<211> 334 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1331955CD1 
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<400> 29 



Mo f- Ala T"h r 
UcL nld. illl. 


fil u 
ox u 


ftl n 


Lys 


Lys 


Pro 


Glu 


Thr 


Glu 


Ala Ala Arg 


Ala 


1 
X 




5 










10 






15 


VjJ.il JriO 1 XIX 


pro 


OCX 


Ser 


OCX 


Ala 


Thr 


Gin 


Ser Lys Pro Thr 


Pro 






20 










25 






30 


T T—% T T >m Or<"\ 

VaX JjyS irxu 


Asn 


ryr 


Ala 


Leu 


Lys 


Phe 


Thr 


Leu Ala Gly His 


Thr 




35 










40 






45 


T vc Ms* Va 1 
X»yS Ala VaX 


Cor 


Cpr 
OCX 


Val 


Lys 


Phe 


Ser 


Pro 


Asn Gly Glu Trp 


Leu 




50 










55 






60 


^1 a Oo r Car 
Ala ocr ocl 


Cor 

OCX 


TV 1 a 
Ala 




uy a 


Leu 


He 


Lys 


He Trp Gly Ala 


xy j- 






65 










70 






75 


Ah>p oXy J-0r» 




OX Li 




Thr 


He 


Ser 


Glv 
oxy 


His 


Lys Leu Gly 


He 






80 










85 






90 


Cor len Val 
Del ASp VaX 


Al a 
nld 


T*rr> 


OCX 


Cor 

OCX 


Asp 


Ser 


Asn 


Leu 


Leu Val Ser 


Ala 






^ -/ 










100 






105 


Cor TA c t^v ftert 
ocX nSp A5u? 


Lys 


Thr 

J. I IX 


Leu 


x»y o 


Tl^ 

X xc 


""Pro 




Val 


Ser Ser Gly 


Lys 






110 










115 






120 


r^A/"0 T on T v e 

tys ircu x>ys» 


1 11X 




±jy a 


oxy 


His 


Ser 


Asn 


Tyr Val Phe Cys 


Cys 






195 
xz _> 










130 






135 


Asn Phe Asn 


Pro 


uin 


Cor 

ocr 


Asn 


Leu 


Tl o 
X Xc 


Val 
VaX 


Ser Gly Ser Phe 








1 4(1 

X «i C 










i'j 






150 


Glu Ser Val 


Arg 


lie 


m 

Trp 


Asp 


Val 


Lys 


Thr 

i nx 


Gly Lys Cys Leu 


jjy 0 






1 










i fin 

X u u 






1 65 

X O J 


Thr Leu Pro 


Ala 


rlxS 


Ser 


Asp 


Pro 


Val 


Ser 


Ala 


Val His Phe 


Asn 






X / U 










X / J 






X O \J 


Arg Asp Gly Ser 


Leu 


Tl « 

lie 


VaX 


Ser 


Ser 


Cor 
OCX 


Tyr Asp Gly Leu 








1 R5 

lOJ 










1 






195 


Arg He Trp Asp 


inr 


71 1 a 

Ala 


oer 


Gly Gin 


Cys 


Leu 


Lys Thr Leu 


Tl 0 
lie 






z u u 










9 D5 

Z U _> 






91 0 
Z X u 


Asp Asp Asp Asn 




Pro 


VaX 


Ser 


Phe 


Val 
val 


Lys 


Phe Ser Pro 








91 5 










220 






225 


Gly Lys Tyr 


He 


Leu 


Zvl a 
Ala 


7a 1 a 
Ala 


Thr 


Leu 


Abp 


Asn 


Thr Leu Lys 


XiCU 






9in 










jj _) 






240 


Trp Asp Tyr 


Ser 


Lys 


oiy 


Lys 


Cys 


Leu 


Lys 


Thr 


Tyr Thr Gly 








Z f± J 










9 5n 






955 

£t J J 


Lys Asn Glu 


Lys 


Tyr 


Cys 


Tl o 
x xc 


Phe 


Ala 


Asn 


Phe 


Ser Val Thr 


oxy 






Z DU 










9 fi5 






97 D 

z / V 


Gly Lys Trp 


He 


Val 
VaX 


Ser 


oiy 


Ser 


Glu 


Asp 


Asn 


Leu Val Tyr 


Tl 0 
X X c 






275 










280 






285 


Trp Asn Leu 


Gin 


Thr 


Lys 


Glu 


He 


Val 


Gin 


Lys 


Leu Gin Gly 


His 






9 Qfl 










9 Q5 

Z .7 D 








Thr Asp Val Val 


Tl o 
X xc 


Cor 

OCX 


i 11X 


Ala 


Cys 


nio 


Pro 


Thr Glu Asn 


Tl 0 

X -L C 






*}05 










^10 

Jlw 






315 


He Ala Ser 


Ala 


Ala 


Leu 


Glu 


Asn Asp 


Lys 


Thr 


He Lys Leu 


Trp 






320 










325 






330 


Lys Ser Asp Cys 




















<210> 30 






















<211> 292 






















<212> PRT 






















<213> Homo : 


sapiens 


















<220> 






















<221> misc_ 


feature 


















<223> Incyte ID 


No: 


1412614CD1 












<400> 30 






















Met Met Ala 


Phe 


Ala 


Pro 


Pro 


Lys 


Asn 


Thr 


Asp Gly Pro Lys 


Met 


1 




5 










10 






15 


Gin Thr Lys 


Met 


Ser 


Thr 


Trp 


Thr 


Pro 


Leu 


Asn 


His Gin Leu 


Leu 






20 










25 






30 


Asn Asp Arg Val 


Phe 


Glu Glu Arg Arg 


Ala 


Leu Leu Gly Lys 


Trp 






35 










40 






45 
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Phe Asp 


Lys 


Trp 


Thr Asp 


Ser 


Gin Arg 


Arg 


Arg 


He Leu Thr 


Gly 








50 










55 






60 


Leu Leu 


Glu Arg 


Cys 


Ser 


Leu 


Ser 


Gin 


Gin 


Lys 


Phe Cys Cys 


Arg 








65 










70 






75 


Lys Leu 


Gin 


Glu 


Lys 


He 


Pro 


Ala 


Glu 


Ala 


Leu 


Asp Phe Thr 


Thr 






80 










85 






90 


Lys Leu 


Pro 


Arg 


Val 


Leu 


Ser 


Leu 


Tyr 


He 


Phe 


Ser Phe Leu 


Asp 






95 










100 






105 


Pro Arg 


Ser 


Leu 


Cys Arg 


Cys 


Ala 


Gin 


Val 


Cys Trp His Trp 


Lys 






110 










115 






120 


Asn Leu 


Ala 


Glu 


Leu Asp 


Gin 


Leu Trp 


Met 


Leu Lys Cys Leu 


Arg 








± 6 j 










130 






135 


Phe Asn 


Trp Tyr 


T 1 

X JLC 


Asn 


Phe 


Ser 


Pro 


Thr 


Pro 


Phe Glu Gin 


Gly 








140 










145 






150 


lie Trp 


Lys 


Lys 


His 


Tyr 


He 


Gin Met 


Val 


Lys 


Glu Leu His 


He 








155 










160 






165 


Thr Lys 


Pro 


Lys 


Thr 


Pro 


Pro 


Lys Asp 


Gly 


Phe 


Val He Ala 


Asp 








170 










175 






180 


Val Gin 


Leu 


val 


Thr 


Ser 


Asn 


Ser 


Pro 


Glu 


Glu 


Lys Gin Ser 


Pro 








185 










190 






195 


Leu Ser 


Ala 


Phe 


Arg 


Ser 


Ser 


Ser 


Ser 


Leu 


Arg Lys Lys Asn 


Asn 








200 










205 






210 


Ser Gly 


Glu 


Lys 


Ala 


Leu 


Pro 


Pro Trp 


Arg 




oci Asp jjys 


His 








215 










220 






225 


Pro Thr 


Asp 


He 


He 


Arg 


Phe 


Asn 


Tyr 


Leu 


Asp 


Asn Arg Asp 


Pro 








230 










235 






240 


Met Glu 


Thr 


Val 


Gin 


Gin 


Gly 


Arg 


Arg 


Lys 


Arg 


Asn Gin He 


Thr 








245 










250 






255 


Pro Asp 


Phe 


Ser 


Arg 


Gin 


Ser 


His 


Asp 


Lys 


Lys 


Asn Lys Leu 


Gin 








260 










265 






270 


Asp Arg 


Thr Arg 


Leu Arg 


Lys 


Ala 


Gin 


Ser 


Met 


Met Ser Arg 


Arg 








275 










280 






285 


Asn Pro 


Phe 


Pro 


Leu Cys 


Pro 




















290 


















<210> 31 






















<211> 588 






















<212> PRT 






















<213> Homo sapiens 


















<220> 
























<221> misc_feature 


















<223> Incyte ID 


No: 


1750781CD1 . 












<400> 31 






















Met Ser 


Ser 


Gly 


Leu 


Arg 


Ala 


Ala Asp 


Phe 


Pro 


Arg Trp Lys 


Arg 


1 






5 










10 






15 


His He 


Ser 


Glu 


Gin 


Leu 


Arg 


Arg 


Arg 


Asp 


Arg 


Leu Gin Arg 


Gin 








20 










25 






30 


Ala Phe 


Glu 


Glu 


He 


He 


Leu 


Gin 


Tyr 


Asn 


Lys 


Leu Leu Glu 


Lys 








35 










40 






45 


Ser Asp 


Leu 


His 


Ser 


Val 


Leu 


Ala 


Gin 


Lys 


Leu 


Gin Ala Glu 


Lys 








50 










55 






60 


His Asp 


Val 


Pro 


Asn Arg 


His 


Glu 


He 


Ser 


Pro Gly His Asp 


Gly 








65 










70 






75 


Thr Trp 


Asn 


Asp 


Asn 


Gin 


Leu 


Gin- Glu 


Met 


Ala 


Gin Leu Arg 


He 








80 










85 






90 


Lys His 


Gin 


Glu 


Glu 


Leu 


Thr 


Glu 


Leu 


His 


Lys 


Lys Arg Gly 


Glu 






95 










100 






105 


Leu Ala 


Gin 


Leu 


Val 


He 


Asp 


Leu 


Asn 


Asn 


Gin Met Gin Arg 


Lys 








110 










115 






120 


Asp Arg 


Glu 


Met 


Gin 


Met 


Asn 


Glu Ala 


Lys 


He Ala Glu Cys 


Leu 








125 










130 






135 


Gin Thr 


He 


Ser 


Asp 


Leu 


Glu 


Thr 


Glu 


Cys 


Leu 


Asp Leu Arg 


Thr 
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140 145 150 

Lys Leu Cys Asp Leu Glu Arg Ala Asn Gin Thr Leu Lys Asp Glu 

155 160 165 

Tyr Asp Ala Leu Gin lie Thr Phe Thr Ala Leu Glu Gly Lys Leu 

170 175 180 

Arg Lys Thr Thr Glu Glu Asn Gin Glu Leu Val Thr Arg Trp Met 

185 190 195 

Ala Glu Lys Ala Gin Glu Ala Asn Arg Leu Asn Ala Glu Asn Glu 

200 205 210 

Lys Asp Ser Arg Arg Arg Gin Ala Arg Leu Gin Lys Glu Leu Ala 

215 220 225 

Glu Ala Ala Lys Glu Pro Leu Pro Val Glu Gin Asp Asp Asp He 

230 235 240 

Glu Val He Val Asp Glu Thr Ser Asp His Thr Glu Glu Thr Ser 

245 250 255 

Pro Val Arg Ala He Ser Arg Ala Ala Thr Arg Arg Ser Val Ser 

260 265 270 

Ser Phe Pro Val Pro Gin Asp Asn Val Asp Thr His Pro Gly Ser 

275 280 285 

Gly Lys Glu Val Arg Val Pro Ala Thr Ala Leu Cys Val Phe Asp 

290 295 300 

Ala His Asp Gly Glu Val Asn Ala Val Gin Phe Ser Pro Gly Ser 

305 310 315 

Arg Leu Leu Ala Thr Gly Gly Met Asp Arg Arg Val Lys Leu Trp 

320 325 330 

Glu Val Phe Gly Glu Lys Cys Glu Phe Lys Gly Ser Leu Ser Gly 

335 340 345 

Ser Asn Ala Gly He Thr Ser He Glu Phe Asp Ser Ala Gly Ser 

350 355 360 

Tyr Leu Leu Ala Ala Ser Asn Asp Phe Ala Ser Arg He Trp Thr 

365 370 375 

Val Asp Asp Tyr Arg Leu Arg His Thr Leu Thr Gly His Ser Gly 

380 385 390 

Lys Val Leu Ser Ala Lys Phe Leu Leu Asp Asn Ala Arg He Val 

395 400 405 

Ser Gly Ser His Asp Arg Thr Leu Lys Leu Trp Asp Leu Arg Ser 

410 415 420 

Lys Val Cys He Lys Thr Val Phe Ala Gly Ser Ser Cys Asn Asp 

425 430 435 

He Val Cys Thr Glu Gin Cys Val Met Ser Gly His Phe Asp Lys 

440 445 • 450 

Lys He Arg Phe Trp Asp He Arg Ser Glu Ser He Val Arg Glu 

455 460 465 

Met Glu Leu Leu Gly Lys He Thr Ala Leu Asp Leu Asn Pro Glu 

470 475 480 

Arg Thr Glu Leu Leu Ser Cys Ser Arg Asp Asp Leu Leu Lys Val 

485 490 495 

He Asp Leu Arg Thr Asn Ala He Lys Gin Thr Phe Ser Ala Pro 

500 505 510 

Gly Phe Lys Cys Gly Ser Asp Trp Thr Arg Val Val Phe Ser Pro 

515 520 525 

Asp Gly Ser Tyr Val Ala Ala Gly Ser Ala Glu Gly Ser Leu Tyr 

530 535 540 

He Trp Ser Val Leu Thr Gly Lys Val Glu Lys Val Leu Ser Lys 

545 550 555 

Gin His Ser Ser Ser He Asn Ala Val Ala Trp Ser Pro Ser Gly 

560 565 570 

Ser His Val Val Ser Val Asp Lys Gly Cys Lys Ala Val Leu Trp 

575 580 585 

Ala Gin Tyr 

<210> 32 
<211> 326 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1821658CD1 

<400> 32 



Met Lys 


Gin Asp 


Ala 


Ser 


Arg 


Asn Ala 


Ala Tyr 


Thr 


Val 


Asp 


Cys 


1 




5 








10 








15 


Glu Asp 


Tyr Val 


HIS 


val 


val 


Glu Phe 


Asn Pro 


Phe Glu Asn Gly 


20 








25 








30 


Asp Ser 


Gly Asn 


Leu 


lie 


Ala 


Tyr Gly Gly Asn 


Asn Tyr Val 


Val 






35 








40 








45 


He Gly 


Thr Cys 


Thr 


Phe 


Gin 


Glu Glu 


Glu Ala 


Asp Val Glu Gly 






50 








55 








60 


He Gin 


Tyr Lys 


Thr 


Leu 


Arg 


Thr Phe 


His His 


Gly Val Arg Val 






65 








70 








75 


Asp Gly 


He Ala 


Trp 


Ser 


Pro 


Glu Thr Arg Leu 


Asp 


Ser 


Leu 


Pro 






80 








85 








90 


Pro Val 


He Lys 


Phe 


Cys 


Thr 


Ser Ala 


Ala Asp 


Met 


Lys 


He 


Arg 






95 








100 








105 


Leu Phe 


Thr Ser 


Asp 


Leu 


Gin 


Asp Lys 


Asn Glu 


Tyr 


Lys 


Val 


Leu 






110 








115 








120 


Glu Gly 


His Thr 


Asp 


Phe 


He 


Asn Gly Leu Val 


Phe 


Asp 


Pro 


Lys 






125 








130 








135 


Glu Gly 


Gin Glu 


He 


Ala 


Ser 


Val Ser Asp Asp 


His 


Thr 


Cys 


Arg 






140 








145 








150 


He Trp 


Asn Leu 


Glu 


Gly 


Val 


Gin Thr 


Ala His 


Phe 


Val 


Leu 


His 




155 








160 








165 


Ser Pro 


Gly Met 


Ser 


Val 


Cys 


Trp His 


Pro Glu 


Glu 


Thr 


Phe 


Lys 






170 








175 








180 


Leu Met 


Val Ala 


Glu 


Lys 


Asn 


Gly Thr 


He Arg 


Phe 


Tyr 


Asp 


Leu 






185 








190 








195 


Leu Ala 


Gin Gin 


Ala 


He 


Leu 


Ser Leu 


Glu Ser 


Glu 


Gin 


Val 


Pro 






200 








205 








210 


Leu Met 


ber Aia 


nlS 


Trp 


Cys 


Leu Lys 


Asn Thr 


Phe 


Lys 


Val 


Gly 






215 








220 








225 


Ala Val 


Ala Gly 


Asn 


Asp 


Trp 


Leu He 


Trp Asp 


He 


Thr 


Arg 


Ser 






230 








235 








240 


Ser Tyr 


Pro Gin 


Asn 


Lys 


Arg 


Pro Val 


His Met 


Asp Arg Ala Cys 






245 








250 








255 


Leu Phe 


Arg Trp 


Ser 


Thr 


lie 


Ser Glu 


Asn Leu 


Phe 


Ala 


Thr 


Thr 






260 








265 








270 


Gly Tyr 


Pro Gly 


Lys 


Met 


Ala 


Ser Gin 


Phe Gin 


He 


His 


His 


Leu 






275 








280 








285 


Gly His 


Pro Gin 


Pro 


He 


Leu 


Met Gly Ser Val 


Ala 


Val 


Gly 


Ser 






290 








295 








300 


Gly Leu 


Ser Trp 


His 


Arg 


Thr 


Leu Pro 


Leu Cys 


Val 


He 


Gly Gly 






305 








310 








315 


Asp His 


Lys Leu 


Leu 


Phe 


Trp 


Val Thr 


Glu Val 











320 325 

<210> 33 
<211> 453 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1872574CD1 

<400> 33 

Met Ala Arg Lys Val Val Ser Arg Lys Arg Lys Ala Pro Ala Ser 
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1 5 
Pro Gly Ala Gly Ser Asp Ala Gin 
20 

His Ser Leu His Lys Arg Lys Arg 
35 

Leu Val Tyr Tyr Leu Lys Asn Arg 
50 

Thr Ser Tyr Ser Arg Val Leu His 
65 

Pro Ser Leu Leu Lys Glu Arg Glu 
80 

Lys Val Phe Ala Ser Gin Trp Leu 
95 

Gly Thr Lys Cys Asn Thr Leu Phe 
110 

Gin lie Thr Lys lie Pro lie Leu 
125 

Val Thr Gin Gin Gly Cys Gly He 
140 

Ser Arg Thr Leu Leu Ala Thr Gly 
155 

Ala He Tyr Arg Leu Pro Thr Leu 
170 

Asp Gly His Lys Asp Trp He Phe 
185 

Thr Met Ala Val Ser Gly Ser Arg 
200 

Glu Val Thr Asp Asp Val Leu Thr 
215 

Val Ser Arg Val Pro Val Tyr Ala 
230 

Lys Asp He Pro Lys Glu Asp Thr 
245 

Arg Ala Leu Ala Phe Asn Asn Lys 
260 

Ser Leu Asp Gly Tyr Phe His Leu 
275 

Ser Lys Leu Leu Ser Thr Lys Leu 
290 

Cys Leu Ala Tyr Gly Ser Glu Trp 
305 

Gin Ala His Val Ser Phe Leu Asp 
320 

Val Lys Ser Val Cys Ser Arg Glu 
335 

Val Ser Phe Tyr Glu His lie lie 
350 

Ser Leu Leu Phe Tyr Asp lie Arg 
365 

Arg Leu Ser Ala Cys Tyr Gly Ser 
380 

Asn Leu Lys Leu Thr Thr Gly Lys 
395 

Thr Trp Arg Asn Tyr Phe Ser Asp 
410 

Val Tyr Thr His Cys Tyr Asp Ser 
425 

Ala Gly Gly Pro Leu Pro Ser Gly 
440 

Leu Trp Ser 

<210> 34 
<211> 161 





10 




15 


Gly 
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10 




X -J 


Cys Leu 


He 


Attt Ayrr Phe ThT 


Glu 




25 




30 


Asp Pro 


Thr 


Val Glv Val Asn 


Phe 




40 




45 


Glu Pro 


Gly 


T.v^ Atct He Lvs 


Leu 




55 




60 


Gin Glu 


Arg 


PVip At*o ^pr Tie 
rut. rix y «jcx lie 


Thr 




70 




75 


Val Gly Gly 


T.on T.oii T.on DVio 
JjcU Jjcu JJCU X lie 


Ala 




85 




90 


Gin Asn 


Val 


nio uiu up ucu 


Glu 




100 




105 


Pro Tyr Gin 


lie vox rxit; vox 


lie U 




115 




X £t \J 


Asp Thr Gin 


nig Ijlil Vol 1X11 






130 




135 


Ala Ala 


Tyr 


Gly Met Lys Tyr 


He 




145 




150 


lie Asn 


Val 


Glu Lys Ala Phe 


Thr 




160 




165 


Glu Leu 


Val 


Lys Arg Gly Glu 


He 




175 




180 


Gly Val 


Lys 


Ser Gly Phe Val 


Pro 




190 




195 


Glu Val 


Val 


Lys Ser Glu Arg 


Arg 




205 




210 



<210> 39 

<211> 393 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3685151CD1 



<400> 39 








Met 


Glu 


Leu 


Val 


Ala Gly 


Cys 


1 








5 




Ala 


Val 


His 


Pro 


Glu Pro 
20 


Glu 


Trp 


Thr 


Leu 


Val 


Ala Asp 
35 


Phe 


Leu 


Ser 


Ala 


Val 


Ala Val 
50 


Asn 


Lys 


Asp 


Glu 


Thr 


He His 
65 


He 


His 


Gly 


Ala 


Leu 


Val His 
80 


His 


Phe 


Tyr 


Gly 


Asn 


Arg His 
95 


Leu 


He 


Cys 


He 


Trp 


Asp Ala 
110 


Lys 


Lys 


Ala 


His 


Lys 


Gly Gin 
125 


Val 



Tyr Glu Gin 


Val 


Leu 


Phe Gly 


Phe 


10 








15 


Ala Cys Gly 


Asp 


His 


Glu Gin 


Gin 


25 








30 


Thr His His 


Ala 


His 


Thr Ala 


Ser 


40 








45 


Ser Arg Phe 


Val Val Thr Gly 


Ser 


55 








60 


Tyr Asp Met 


Lys 


Lys 


Lys lie 


Glu 


70 








75 


Ser Gly Thr 


He 


Thr 


Cys Leu 


Thr 


85 








90 


He Ser Gly 


Ala Glu Asp Gly 


Leu 


100 








105 


Lys Trp Glu 


Ser 


Leu 


Thr Ser 


lie 


115 








120 


Thr Phe Leu 


Ser 


lie 


His Pro 


Ser 


130 








135 
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Gly Lys Leu Ala Leu Ser Val Gly Thr Asp Lys Thr Leu Arg Thr 

140 145 150 

Trp Asn Leu Val Glu Gly Arg Ser Ala Phe lie Lys Asn lie Lys 

155 160 165 

Gin Asn Ala His lie Val Glu Trp Ser Pro Arg Gly Glu Gin Tyr 

170 175 180 

Val Val He He Gin Asn Lys He Asp He Tyr Gin Leu Asp Thr 

185 190 195 

Ala Ser He Ser Gly Thr He Thr Asn Glu Lys Arg He Ser Ser 

200 205 210 

Val Lys Phe Leu Ser Glu Ser Val Leu Ala Val Ala Gly Asp Glu 

215 220 225 

Glu Val He Arg Phe Phe Asp Cys Asp Ser Leu Val Cys Leu Cys 

230 235 240 

Glu Phe Lys Ala His Glu Asn Arg Val Lys Asp Met Phe Ser Phe 

245 250 255 

Glu He Pro Glu His His Val He Val Ser Ala Ser Ser Asp Gly 

260 265 270 

Phe He Lys Met Trp Lys Leu Lys Gin Asp Lys Lys Val Pro Pro 

275 280 285 

Ser Leu Leu Cys Glu He Asn Thr Asn Ala Arg Leu Thr Cys Leu 

290 295 300 

Gly Val Trp Leu Asp Lys Val Ala Asp Met Lys Glu Ser Leu Pro 

305 310 315 

Pro Ala Ala Glu Pro Ser Pro Val Ser Lys Glu Gin Ser Lys He 

320 325 330 

Gly Lys Lys Glu Pro Gly Asp Thr Val His Lys Glu Glu Lys Arg 

335 340 345 

Ser Lys Pro Asn Thr Lys Lys Arg Gly Leu Thr Gly Asp Ser Lys 

350 355 360 

Lys Ala Thr Lys Glu Ser Gly Leu He Ser Thr Lys Lys Arg Lys 

365 370 375 

Met Val Glu Met Leu Glu Lys Lys Arg Lys Lys Lys Lys He Lys 

380 385 390 

Thr Met Gin 

<210> 40 
<211> 399 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4881515CD1 

<400> 40 

Met Ser Leu Gin Tyr Gly Ala Glu Glu Thr Pro Leu Ala Gly Ser 

15 10 15 

Tyr Gly Ala Ala Asp Ser Phe Pro Lys Asp Phe Gly Tyr Gly Val 

20 25 30 

Glu Glu Glu Glu Glu Glu Ala Ala Ala Ala Gly Gly Gly Val Gly 

35 40 45 

Ala Gly Ala Gly Gly Gly Cys Gly Pro Gly Gly Ala Asp Ser Ser 

50 55 60 

Lys Pro Arg He Leu Leu Met Gly Leu Arg Arg Ser Gly Lys Ser 

65 70 75 

Ser He Gin Lys Val Val Phe His Lys Met Ser Pro Asn Glu Thr 

80 85 90 

Leu Phe Leu Glu Ser Thr Asn Lys He Tyr Lys Asp Asp He Ser 

95 100 105 

Asn Ser Ser Phe Val Asn Phe Gin He Trp Asp Phe Pro Gly Gin 

110 115 120 

Met Asp Phe Phe Asp Pro Thr Phe Asp Tyr Glu Met lie Phe Arg 
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125 


Gly 


Thr 


Gly Ala 


Leu 








140 


Met 


Glu 


Ala Leu 


Thr 








155 


Lvs 


Val 


Asn Pro 


Asp 








170 


Asp 


Glv 


Leu ' Ser 


Asp 








185 


His 


Gin 


Arg Ala 


Asn 








200 


Leu 


His 


Leu Ser 


Phe 








215 


Phe 


Glu 


Ala Phe 

riXu £ lie 


Ser 








230 


Pro 


Thr 




Asn 










He 


Glu 


Lv^ Ala 


Phe 








260 


Ala 


Thr 


Asp Ser 


Ser 








275 


Cys 


Asp 


Met He 


Asp 








290 


Leu 


Lys 


Glu Aso 


Glv 








JvJ 


Ala 


He 












320 


Val 


Thr 


Lys Phe 


Leu 








•j j j 


Phe 


Glu 


ax y juy a 


VJ X Jf 








J Jw 


Lys 


Ala 


He His 


Glu 








365 


Ser 


Cys 


Gly His 


Gin 








380 


His 


Asn 


Gly Thr 


Pro- 








395 



<210> 41 

<211> 412 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 

<400> 41 



Met 


Ala 


Gly 


Ser 


Val 


1 








5 


Val 


Arg 


Gly 


Gly 


Ser 
20 


Asp 


Asp 


Asp 


Ser 


Leu 
35 


Lys 


Ser 


Gin 


Glu 


Asn 
50 


Ser 


Gly 


Ala 


He 


Leu 
65 


Phe 


Ala 


Leu 


Thr 


Asp 
80 


Lys 


Pro 


Trp 


Gin 


Cys 
95 


Thr 


Ala 


Leu 


Thr 


Phe 
110 











X J \J 


He 


Tyr 


Val 


He 


Asp 
145 


Arg 


Leu 


His 


He 


Thr 
160 


Met 


Asn 


Phe 


Glu 


Val 
175 


Asp 


His 


Lys 


He 


Glu 
190 


Asp 


Asp 


Leu 


Ala 


Asp 
205 


Tyr 


Leu 


Thr 


Ser 


He 
220 


Lys 


Val 


Val 


Gin 


Lys 
9^5 

£t J «J 


Leu 


Leu 


Asn 


He 


Phe 
9 so 


Leu 


Phe 


Asp 


Val 


Val 

9 fi5 
z o _> 


Pro 


Val 


Asp 


Met 


Gin 
9 Rn 


Val 


Val 


He 


Asp 


Val 

995 
j j 


Ser 


Gly 


Ser 


Ala 


Tyr 

J xU 


Asn 


Asn 


Thr 


Thr 


Val 

^95 
j z j 


Ala 


Leu 


Val 


Cys 


He 

j y u 


Leu 


He 


Asp 


Tvx 


Asn 
355 


Val 


Phe 


Glu 


val 


Gly 
370 


Thr 


Ser 


Ala 


Ser 


Ser 
385 


Arg 


Asn 


Ala 


He 
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Gly 


Leu 


Ala 


Leu 


Cys 
10 


Arg 


Phe 


Leu 


Ala 


Thr 
25 


Phe 


He 


Tyr 


Asp 


Cys 
40 


Lys 


Gly 


Glu 


Asp 


Ala 
55 


Ala 


Ser 


Thr 


Phe 


Ser 
70 


Asp 


Ser 


Lys 


Arg 


Leu 
85 


Leu 


Ser 


Val 


Arg 


Thr 
100 


He 


Ala 


Ser 


Glu 


Glu 
115 



40/1 



135 



Ala 


Gin 


Asp 


Asp 


Tvr 










150 


Val 


Ser 


Lvs 


Ala 


Tvr 










165 


Phe 


He 


His 


Lys 


Val 










180 


Thr 


Gin 


Arcr 


Asp 


lie 










195 


Ala 

AX CI 


fil V 
vaxy 


JJCU 


Glu 


Lys 










210 


Tvr 
iyx 


Asp 


His 


Ser 


He 










225 




X xc 


Pro 


Gin 


Leu 










240 


X X C 


OCX 


Asn 


Cor 


viy 










255 


Cor 
OCX 




He 

X xc 


xyx 


He 










270 


ser 


Tyr 


ulU 




vy s 










285 
o _> 


oci 


cys 


Tip 
X XC 


Tyr 












300 


ASp 


Lys 


VJJX Li. 


OCX 


Mot- 
ive i_ 










"315 

J X J 


Leu 


Tyr 


Leu 


Lys 


oiu 










330 


Leu 


Arg 


Glu 


Glu 


Ser 










345 


Phe 


His 


Cys 


Phe 


Arg 










360 


Val 


Thr 


Ser 


His 


Arg 










375 


T ,pi ^ 
■ucu 


T ,\rc 


Al a 

AX CI 


Leu 


Thr 










390 


Glv 


Gin 


Thr 


Leu 


Val 










15 


Ser 


He 


Ala 


Ser 


Ser 










30 


Ser 


Ala 


Ala 


Glu 


Lys 










45 


Pro 


Leu 


Asp 


Gin 


Glv 

oxy 










60 


L<ys 


Cor 
OCX 


\j a. y 


Cor 
OCX 


xyi: 










75 


He 


Leu 


Phe 


Arg 


Thr 










90 


Val 


Ala 


Arg 


Arg 


Cys 










105 


Lys 


Val 


Leu 


Val 


Ala 










120 
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Asd 


Lvs 


Ser Gly 


Asn 


Val 


Tvr 


Ser Phe 


Ser 


Val 


Leu Glu 


Pro 


His 










125 








130 








135 


Gly 


Cys 


Gly Arg 


Leu 


Glu 


Leu 


Gly His 


Leu 


Ser 


Met Leu 


Leu 


Asd 










140 








145 








150 


Val 


Ala 


Val 


Ser 


Pro 


Asp 


Asd 


Arg Phe 


He 


Leu 


Thr Ala 


Asp 


Arg 










155 








160 








165 


Asp 


Glu 


Lys 


lie 


Arg 


Val 


Ser 


Trp Ala 


Ala 


Ala 


Pro His 


Ser 


He 










170 








175 








180 


Glu 


Ser 


Phe 


Cvs 


Leu 


Glv 


His 


Thr Glu 


Phe 


Val 


Ser Arg 


He 


Ser 








Thr 


185 








190 








195 


Val 


Val 


Pro 


Gin 


Pro 


Glv 


Leu Leu 


Leu 


Ser 


Ser Ser 


Glv 


Asd 










200 








205 








210 


Gly 


Thr 


Leu 


Arg 


Leu 


Tro 


Glu 


Tvr Arcr 


Ser 


Gly Arg Gin 


Leu 


His 










215 








220 








225 


Cvs 


Cvs 


His 


Leu 


Ala 


Ser 


Leu 


Gin Glu 


Leu 


Val 


Asp Pro 


Gin 


Ala 










230 








235 








240 


Pro 


Gin 


Lys 


Phe 


Ala 


Ala 


Ser 


Arg lie 


Ala 


Phe 


Trp Cys 


Gin 


Glu 










245 








250 








255 


Asn 


Cys 


Val 


Ala 


Leu 


Leu 


Cvs 


Asd Glv 


Thr 


Pro 


Val Val 


Tvr 


He 










260 








265 








270 


Phe 


Gin 


Leu Asp 


Ala 


Arg 


Arg 


Gin Gin 


Leu 


Val 


Tyr Arg 


Gin 


Gin 










275 








280 








285 


Leu 


Ala 


Phe 


Gin 


His 


Gin 


Val 


Trp Asp 


Val 


Ala 


Phe Glu 


Glu 


Thr 










290 








295 








300 


Gin 


Glv 


Leu 


Trp 


Val 


Leu 


Gin 


nay vjr s 


Gin 


Glu 


Ala Pro 


Leu 


Val 










305 








310 








315 


Leu 


Tyr 


Arg 


Pro 


Val 


Glv 


Asp 


Gin Trp 


Gin 


Ser 


Val Pro 


Glu 


Ser 










320 








325 








330 


Thr 


Val 


Leu 


Lys 


Lys 


Val 


Ser 


Gly Val 


Leu 


Arg 


Gly Asn 




Ala 










335 








340 








345 


Met 


Leu 


Glu Gly 


Ser 


Ala 


Glv 


Ala Asp 


Ala 


Ser 


Phe Ser 


Ser 


Leu 










350 








355 








360 


Tyr 


Lys 


Ala 


Thr 


Phe 


Asp 


Asn 


Val Thr 


Ser 


Tyr Leu Lys 


Lys 


Lys 










365 








370 








375 


Glu 


Glu 


Arg 


Leu 


Gin 


Gin 


Gin 


Leu Glu 


Lys 


Lys 


Gin Arg 


Arg 


Arg 










380 








385 








390 


Ser 


Pro 


Pro 


Pro 


Gly 


Pro 


Asp 


Gly His 


Ala 


Lys 


Lys Met 


Arg 


Pro 










395 








400 








405 


Gly 


Glu 


Ala 


Thr 


Leu 


Ser 


Cys 















410 
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<211> 163 

<212> PRT 

<213> Homo sapiens 

<220> 

N <221> misc_feature 

<223> Incyte ID No: 5387651CD1 

<400> 42 



Met 


Asp 


Ala 


Leu 


Glu 


Gly 


Glu 


Ser Phe 


Ala 


Leu 


Ser Phe 


Ser 


Ser 


1 








5 








10 








15 


Ala 


Ser 


Asp 


Ala 


Glu 


Phe 


Asp 


Ala Val 


Val 


Gly Tyr Leu 


Glu 


Asp 










20 








25 








30 


He 


He 


Met 


Asp 


Asp 


Glu 


Phe 


Gin Leu 


Leu 


Gin 


Arg Asn 


Phe 


Met 










35 








40 








45 


Asp 


Lys 


Tyr 


Tyr 


Leu 


Glu 


Phe 


Glu Asp 


Thr 


Glu 


Glu Asn 


Lys 


Leu 










50 








55 








60 


He 


Tyr 


Thr 


Pro 


He 


Phe 


Asn 


Glu Tyr 


lie 


Ser 


Leu Val 


Glu 


Lys 










65 








70 








75 


Tyr 


He 


Glu 


Glu 


Gin 


Leu 


Leu 


Gin Arg 


He 


Pro 


Glu Phe 


Asn 


Met 










80 








85 








90 


Ala 


Ala 


Phe 


Thr 


Thr 


Thr 


Leu 


Gin His 


His 


Lys 


Asp Glu 


Val 


Ala 
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95 




100 








105 


Gly Asp He 


Phe Asp 


Mpt" TiPii Tj£»u 

UCL UCU UvU 


Thr* Php 


rpVi y A Q"0 

X J. IX AO £J 


Phe 


Leu 


Ala 






110 




115 








120 


Phe Lys Glu 


Met 


Phe 


Leu Asp Tyr 


Arg Ala 


Glu Lys 


Glu 


Gly 


Arg 






125 




130 








135 


Gly Leu Asp Leu 


Ser 


Ser Gly Leu 


Val Val 


Thr Ser 


Leu 


Cys 


Lys 






140 




145 








150 


Ser Ser Ser 


Leu 


Pro 


Ala Ser Gin 


Asn Asn 


Leu Arg 


His 










155 




160 











<210> 43 
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<212> PRT 
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*<220> 

<221> misc_feature 

<223> Incyte ID No: 5595679CD1 

<400> 43 



Met 


Gin 


Glu 


Ser 


Gly 


Cys 


Arg 


Leu 


GlU 


His 


Pro Ser 


Ala 


Thr 


Lys 


1 








5 










10 








Id 


Pne 


Arg 


Asn 


His 


Val 


Met 


Glu 


Gly 


Asp 


Trp 


Asp Lys 


Ala 


Glu 


Asn 










20 










25 








j U 


Asp 


Leu 


Asn 


Glu 


Leu 


Lys 


Pro 


Leu 


Val 


His 


Ser Pro 


His. Ala 


Tl - 

He 










35 










40 










vai 


val 


Arg 


Gly 


Ala 


Leu 


Glu 


Tl- 
IlS 


Ser 


Gin 


Thr Leu 


Leu 


oiy 


Tl . 

lie 










50 










55 








bU 


lie 


t r— T 

val 


Arg 


Met 


Lys 


pne 


Leu 


Leu 


Leu 


Gin 


Gin Lys 


Tyr 


Leu 


Glu 










DD 










70 








/ D 


Tyr 


T an 

Leu 


GlU 


ASp 


Gly 


T i rn 

Lys 


TT-1 

vai 


Leu 


Glu 


Ala 


Leu Gin 


vai 


Leu 


Arg 










ou 










85 










Cys 


GlU 


Leu 


Tnr 


pro 


Leu 


Lys 


Tyr 


Asn 


Thr 


Glu Arg 


He 


His 


vai 




















100 








1UD 


Leu 


Ser 


vjiy 


Tyr 


Leu 


riec 


Cys 


Ser 


His 


Ala 


Glu Asp 


Leu 


Arg 


Ala 

Aid 










11U 










115 










Lys 


Ala 


bill 


Trp 


Pi,, 
blu 


biy 


Lys 


taiy 


Thr 


Ala 


Ser Arg 


Ser 


Lys 


Leu 










125 










130 








135 


Leu 


Asp 


Lys 


Leu 


Gin 


Thr 


Tyr 


Leu 


Pro 


Pro 


Ser Val 


Met 


Leu 


Pro 










140 










145 








150 


Pro 


Arg 


Arg 


Leu 


Gin 


Thr 


Leu 


Leu 


Arg 


Gin 


Ala Val 


Glu 


Leu 


Gin 










155 










160 








165 


Arg 


Asp 


Arg 


Cys 


Leu 


Tyr 


His 


Asn 


Thr 


Lys 


Leu Asp 


Asn 


Asn 


Leu 










170 










175 








180 


Asp 


Ser 


Val 


Ser 


Leu 


Leu 


He 


Asp 


His 


Val 


Cys Ser 


Arg 


Arg 


Gin 










185 










190 








195 


Phe 


Pro 


Cys 


Tyr 


Thr 


Gin 


Gin 


He 


Leu 


Thr 


Glu His 


Cys 


Asn 


Glu 










200 










205 








210 


Val 


Trp 


Phe 


Cys 


Lys 


Phe 


Ser 


Asn 


Asp Gly 


Thr Lys 


Leu 


Ala 


Thr 










215 










220 








225 


Gly 


Ser 


Lys 


Asp 


Thr 


Thr 


Val 


He 


He 


Trp 


Gin Val 


Asp 


Pro 


Asp 










230 










235 








240 


Thr 


His 


Leu 


Leu 


Lys 


Leu 


Leu 


Lys 


Thr 


Leu 


Glu Gly 


His 


Ala 


Tyr 










245 










250 








255 


Gly 


Val 


Ser 


Tyr 


lie 


Ala 


Trp 


Ser 


Pro Asp 


Asp Asn 


Tyr 


Leu 


Val 










260 










265 








270 


Ala 


Cys 


Gly 


Pro 


Asp 


Asp 


Cys 


Ser 


Glu 


Leu 


Trp Leu 


Trp 


Asn 


Val 










275 










280 








285 


Gin 


Thr 


Gly 


Glu 


Leu 


Arg 


Thr 


Lys 


Met 


Ser 


Gin Ser 


His 


Glu 


Asp 










290 










295 








300 


Ser 


Leu 


Thr 


Ser 


Val 


Ala 


Trp 


Asn 


Pro Asp 


Gly Lys 


Arg 


Phe 


Val 










305 










310 








315 


Thr 


Gly 


Gly 


Gin 


Arg 


Gly 


Gin 


Phe 


Tyr 


Gin 


Cys Asp 


Leu 


Asp 


Gly 



320 325 330 
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Asn Leu 


Leu 


Asp 


Ser 


Trp 


Glu 


Gly Val Arg 


Val Gin 


Cys 


Leu 


Trp 








335 






340 








345 


Cys Leu 


Ser 


Asp 


Gly 


Lys 


Thr 


Val Leu Ala 


Ser Asp 


Thr 


His 


Gin 








350 






355 








360 


Arg lie 


Arg 


Gly 


Tyr 


Asn 


Phe 


Glu Asp Leu 


Thr Asp 


Arg 


Asn 


lie 








365 






370 








375 


Val Gin 


Glu 


Asp 


His 


Pro 


He 


Met Ser Phe 


Thr lie 


Ser 


Lys 


Asn 








380 






385 








390 


Gly Arg 


Leu 


Ala 


Leu 


Leu 


Asn 


Val Ala Thr 


Gin Gly Val 


His 


Leu 








395 






400 








405 


Trp Asp 


Leu 


Gin 


Asp 


Arg 


Val 


Leu Val Arg 


Lys Tyr 


Gin 


Gly 


Val 








410 






415 








420 


Thr Gin 


Gly 


Phe 


Tvr 


Thr 


He 


His Ser Cys 


Phe Gly Gly 


His 


Asn 








425 






430 








435 


Glu Asp 


Phe 


He 


Ala 


Ser 


Gly 


Ser Glu Asp 


His Lys 


Val 


Tyr 


lie 








440 






445 








450 


Trp His 


Lvs 


Arcr 


Ser 


Glu 


Leu 


Pro He Ala 


f~l 111 T ai i 


i. ILL 


Glv 


His 








455 






460 








465 


Thr Arg 


Thr 


Val 


Asn 


Cys 


Val 


Ser Trp Asn 


- 

Pro Gin 


Tip 


Pro 


Ser 








470 






475 








480 


Met Met 


Ala 


Ser 


Ala 


Ser 


Asp 


Acn Glv Thr 


val Ax y 


Tip 


Tro 


Glv 








485 






490 








495 


Pro Ala 


Pro 


Phe 


He 


Asp 


His 


Gin Asn He 


r*l ii Pin 


uiU 


Cvs 


Ser 








500 






505 








510 


Ser Met 


Asp 


Ser 
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<223> Incyte ID 
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<400> 44 




















Met Glu Glu Trp 


Asp 


Val 


Pro 


Gin Met Lys 


T x re* /*5 1 n 

uys uiU 


Val 


Glu 


Ser 


1 






5 






10 








15 


Leu Lys 


Tyr 


Gin 


Leu 


Ala 


Phe Gin Arg Glu 


"Mot" 2V 1 a 
ncL Aid 


Ser 


Lvs 


Thr 








20 






25 








30 


lie Pro 


Glu 


Leu 


Leu 


Lys 


Trp 


He Glu Asp 


Gly lie 


Pro 


Lys 


Asp 








35 






40 








45 


Pro Phe 


Leu 


Asn 


Pro 


Asp 


Leu Met Lys Asn 


Asn Pro 


Trp 


Val 


Glu 








50 






55 








60 


Lys Gly Lys Cys 


Thr 


He 


Leu 


















65 
















<210> 45 




















<211> 315 




















<212> PRT 




















<213> Homo sapiens 
















<220> 






















<221> misc_: 


feature 
















<223> Incyte ID 


No: 


760677CD1 










<400> 45 




















Met Ala 


Phe 


Pro 


Glu 


Pro 


Lys 


Pro Arg Pro 


Pro Glu 


Leu 


Pro 


Gin 


1 






5 






10 








15 


Lys Arg Leu Lys 


Thr 


Leu Asp Cys Gly Gin 


Gly Ala Val 


Arg 


Ala 








20 






25 








30 


Val Arg 


Phe 


Asn Val 


Asp Gly Asn Tyr Cys 


Leu Thr 


Cys 


Gly 


Ser 








35 






40 








45 


Asp Lys 


Thr Leu Lys 


Leu 


Trp Asn Pro Leu 


Arg Gly Thr 


Leu 


Leu 
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50 








55 




60 


Arg 


Thr 


Tyr Ser 


Gly 


His 


Gly 


Tyr 


Glu Val 


Leu Asp Ala Ala 


Gly 








65 








70 




75 


Ser 


Phe 


Asp Asn 


Ser 


Ser 


Leu 


Cys 


Ser Gly 


Gly Gly Asp Lys 


Ala 








80 








85 




90 


Val 


Val 


Leu Trp 


Asn 


Val 


Ala 


Ser 


Gly Gin 


Val Val Arg Lys 


Phe 








95 








100 




105 


Arg 


Gly 


His Ala 


Gly 


Lys 


Val 


Asn 


Thr Val 


Gin Phe Ser Glu 


Glu 








110 








115 




120 


Ala 


Thr 


Val He 


Leu 


Ser 


Gly 


Ser 


He Asp 


Ser Ser He Arg 


Cys 








125 








130 




135 


Trp 


Asp 


Cys Arg 


Ser 


Arg 


Arg 


Pro 


Glu Pro 


Val Gin Thr Leu 


Asp 








140 








145 




150 


Glu 


Ala 


Arg Asp 


Gly 


Val 


Ser 


Ser 


Val Lys 


Val Ser Asp His 


Glu 








155 








160 




165 


lie 


Leu 


Ala Gly 


Ser 


Val 


Asp 


Gly 


Arg Val 


Arg Arg Tyr Asp 


Leu 








170 








175 




180 


Arg 


Met 


Gly Gin 


Leu 


Phe 


Ser 


Asp 


Tyr Val 


Gly Ser Pro He 


Thr 








185 








190 




195 


Cys 


Thr 


Cys Phe 


Ser 


Arg 


Asp 


Gly 


Gin Cys 


Thr Leu Val Ser 


Ser 








200 








205 




210 


Leu 


Asp 


Ser Thr 


Leu 


Arg 


Leu 


Leu 


Asp Lys 


Asp Thr Gly Glu 


Leu 








215 








220 




225 


Leu 


Gly 


Glu Tyr 


Lys 


Gly 


His 


Lys 


Asn Gin 


Glu Tyr Lys Leu 


Asp 








230 








235 




240 


Cys 


Cys 


Leu Ser 


Glu 


Arg 


Asp 


Thr 


His Val 


Val Ser Cys Ser 


Glu 








245 








250 




255 


Asp 


Gly 


Lys Val 


Phe 


Phe 


Trp 


Asp 


Leu Val 


Glu Gly Ala Leu 


Ala 








260 








265 




270 


Leu 


Ala 


Leu Pro 


Val 


Gly 


Ser 


Gly 


Val Val 


Gin Ser Leu Asp 


Tyr 








275 








280 




285 


His 


Pro 


Thr Glu 


Pro 


Cys 


Leu 


Leu 


Thr Ala 


Met Gly Gly Ser 


Val 








290 








295 




300 


Gin 


Cys 


Trp Arg 


Glu 


Glu 


Ala 


Tyr 


Glu Ala 


Glu Asp Gly Ala 


Gly 



305 310 315 

<210> 46 

<211> 504 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No; 1348567CD1 

<400> 46 



Met 


Ser 


Leu 


He 


Cys 


Ser 


He 


Ser 


Asn 


Glu 


Val 


Pro 


Glu 


His 


Pro 


1 








5 










10 










15 


Cys 


Val 


Ser 


Pro 


Val 


Ser 


Asn 


His 


Val 


Tyr 


Glu Arg Arg Leu 


He 










20 










25 










30 


Glu 


Lys 


Tyr 


He 


Ala 


Glu 


Asn 


Gly 


Thr 


Asp 


Pro 


He 


Asn 


Asn 


Gin 










35 










40 










45 


Pro 


Leu 


Ser 


Glu 


Glu 


Gin 


Leu 


He 


Asp 


He 


Lys 


Val 


Ala 


His 


Pro 










50 










55 










60 


He 


Arg 


Pro 


Lys 


Pro 


Pro 


Ser 


Ala 


Thr 


Ser 


He 


Pro 


Ala 


He 


Leu 










65 










70 










75 


Lys 


Ala 


Leu 


Gin 


Asp 


Glu 


Trp 


Asp 


Ala 


Val 


Met 


Pro 


His 


Ser 


Phe 










80 










85 










90 


Thr 


Leu 


Arg 


Gin 


Gin 


Leu 


Gin 


Thr 


Thr Arg 


Gin 


Glu 


Leu 


Ser 


His 










95 










100 










105 


Ala 


Leu 


Tyr 


Gin 


His 


Asp 


Ala 


Ala 


Cys 


Arg 


Val 


He 


Ala 


Arg 


Leu 










110 










115 










120 


Thr 


Lys 


Glu 


Val 


Thr 


Ala 


Ala 


Arg 


Glu 


Ala 


Leu 


Ala 


Thr 


Leu 


Lys 
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125 








Pro 


Gin 


Ala Gly 


Leu 


He 


val 


Pro 










140 








Pro 


Ser 


Val 


Val 


Gly 


Ala 


Gly 


Glu 










155 








Val 


Gly 


Met 


Thr 


Pro 


Glu 


He 


He 










170 








Thr 


Val 


Leu 


Thr 


Thr 


Glu 


Arg 


Lys 










185 








Glu 


Glu 


Leu 


Val 


Lys 


Pro 


Glu 


Glu 










200 








Ala 


Ser 


His 


Val 


Gly 


Leu 


His 


Ser 










215 








Ala 


Leu 


Asp 


Leu 


Cys 


Pro 


Ser 


Asp 










230 








Gly 


Ala 


Asp 


Lys 


Asn 


Val 


Val 


Val 










245 








lie 


Leu 


Ala 


Thr 


Leu 


Lys 


Gly 


His 










260 








Val 


Phe 


His 


Pro 


Ser 


Gin 


Asp 


Leu 










275 








Ala 


Thr 


He Arg 


He 


Trp 


Ser 


Val 










290 








Val 


Val 


Arg Ala 


His 


Glu 


Ser 


Ala 










305 








Ala 


Thr 


Gly Asp 


Tyr 


Leu 


Leu 


Ser 










320 








Ala 


Phe 


Ser 


Asp 


He 


Gin 


Thr 


Gly 










335 








ASD 


Glu 


Thr 


Ser 


Gly 


Cvs 


Ser 


Leu 










350 








Asp 


Gly 


Leu 


He 


Phe 


Gly 


Thr 


Gly 










365 








He 


Trp 


Asp Leu 


Lys 


Glu 


Arg 


Thr 










380 








His 


Ser 


Gly Pro 


He 


Thr 


Ser 


He 










395 








Tyr 


Leu 


Ala 


Thr 


Ala 


Ala 


Asp 


Asp 










410 








Leu 


Ara 


Lys 


Leu 


Lvs 


Asn 


Phe 


LVS 










425 








Phe 


Glu 


Val 


Lys 


Ser 


Leu 


He 


Phe 










440 








Ala 


Leu 


Gly Gly 


Thr 


Asp 


Val 


Gin 










455 








Thr 


Glu 


He 


Leu 


His 


Phe 


Thr 


Glu 










470 








Val 


Ala 


Phe Gly 


His 


His 


Ala 


Lys 










485 








Asp 


Arg 


Ser 


Leu 


Lys 


Phe 


Tyr 


Ser 










500 









<210> 47 

<211> 522 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1751354CD1 

<400> 47 

Met Ala Phe Leu Asp Asn Pro Thr 
1 5 





130 








135 


Gin 


Ala 


Val 


Pro 


Ser Ser 


Gin 




145 








150 


Pro 


Met 


Asp 


Leu 


Gly Glu 


Leu 




160 








165 


Gin 


Lys 


Leu 


Gin 


Asp Lys 


Ala 




175 








180 


Lys 


Arg 


Gly 


Lys 


Thr Val 


Pro 




190 








195 


Leu 


Ser 


Lys 


Tyr 


Arg Gin 


Val 




205 








210 


Ala 


Ser 


lie 


Pro 


Gly lie 


Leu 




220 








225 


Thr 


Asn 


Lys 


He 


Leu Thr 


Gly 




235 








240 


Phe 


Asp 


Lys 


Ser 


Ser Glu 


Gin 




250 








255 


Thr 


Lys 


Lys 


Val 


Thr Ser 


Val 




265 








270 


Val 


Phe 


Ser 


Ala 


Ser Pro 


Asp 




280 








285 


Pro 


Asn 


Ala 


Ser 


Cys Val 


Gin 




295 








300 


Val 


Thr 


Gly 


Leu 


Ser Leu 


His 




310 








315 


Ser 


Ser 


Asp 


Aso 


Gin Tyr 


Trp 




325 








330 


Aro 


Val 


Leu 


Thr 


Lys Val 


Thr 




340 








345 


Thr 


Cvs 


Ala 


Gin 


Phe His 


Pro 




355 








360 


Thr 


Met 


Asp 


Ser 


Gin lie 


Lys 




370 








375 


Asn 


Val 


Ala 


Asn 


Phe Pro 


Gly 




385 








390 


Ala 


Phe 


Ser 


Glu 


Asn Gly 


Tyr 




400 








405 


Ser 


Ser 


Val 


Lys 


Leu Trp 


Asp 




415 








420 


Thr 


Leu 


Gin 


Leu 


Asp Asn 


Asn 




430 








435 


Asp 


Gin 


Ser 


Glv 


Thr Tyr 


Leu 




445 








450 


He 


Tyr 


He 


Cys 


Lys Gin 


Trp 




460 








465 


His 


Ser 


Gly 


Leu 


Thr Thr 


Gly 




475 








480 


Phe 


He 


Ala 


Ser 


Thr Gly 


Met 




490 








495 


Leu 












He 


He 


Leu 


Ala 


His He 


Arg 




10 








15 
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Gin Ser 


His 


Val 


Thr 


Ser Asp Asp 


Thr 


Glv 


Met 


Cvs 


Glu 


Met 


Val 








20 






25 










30 


Leu lie 


Asp 


His 


Asd 


Val Asp Leu 


Glu 


Lys 


He 


His 


Pro 


Pro 


Ser 








35 






40 










45 


Met Pro 


Gly Asp 


Ser 


Gly Ser Glu 


He 


Gin 


Gly 


Ser 


Asn 


Gly 


Glu 








50 






55 










60 


Thr Gin 


Gly Tyr 


Val 


Tyr Ala Gin 


Ser 


Val 


Asp 


lie 


Thr 


Ser 


Ser 








65 






70 










75 


Trp Asp 


Phe Gly 


He 


Arcr Arcr Arcr 


Ser 


Asn 


Thr 


Ala 


Gin 


Arg 


Leu 








80 






85 










90 


uxu c\±. y 


Leu Arg 


Lys 


Glu Ara Gin 


Asn 


Gin 


lie 


Lvs 


Cvs 


Lvs 


Asn 






95 






100 










105 


Tip Gin 

lie VJlll 


Trp 


Lys 


Glu 


Ara Asn Ser 

ai y noli o ^ j. 


Lys 


Gin 


Ser 


Ala 


Gin 


Glu 


Leu 








110 






115 










120 


Lys Ser 


Leu 


Pne 


Glu 


Lvs Lvs Ser 


Leu 


Lvs 


Glu 


Lvs 


Pro 


Pro 


He 






125 






130 










135 


Cpr- (TIT \>r 


Lys 


Gin 


Ser 


Tie Leu Ser 


Val 


Arcr 


Leu 


Glu 


Gin 


Cvs 

jr *J 


Pro 








140 






145 










150 


T.en 1 Tt 

JjC Li. Ulli 


Leu 


Asn 


Asn 


Pro Phe Asn 


Glu 


Tvr 


Ser 


Lys 


Phe 


Asd 


Glv 








155 






160 










165 


Lys Gly 


His 


Val 


Glv 


Thr Thr Ala 


Thr 


Lvs 


Lvs 


He 


Asp 


Val 


Tyr 








170 






175 










180 


Lieu rx.u 


Leu 


His 


Cpr 


Ser Gin Asti 

Jvl will rtO^ 


Arcr 


Leu 


Leu 


Pro 


Met 


Thr 


Val 








185 






190 










195 


VQl 1 111 


Met 


Ala 


Ser 


Ala Ara Val 

nlu *»i y veil 


Gin 


Asp 


Leu 


He 


Glv 


Leu 


lie 








aw 






205 










210 


Cys Trp 


Gin 


Tyr 


Thr 


Ser Glu Glv 

kP^X UXU W4.JT 


Arcr 

«i y 


Glu 


Pro 


Lys 


Leu 


Asn 


Asd 








215 






220 










225 


AVOll VQX 


Ser 


Ala 




Cvs Leu His 


lie 


Ala 


Glu 


Asp 


Asp 


Gly 


Glu 








£* J \J 






235 










240 


Va 1 Asd 


Thr 


Asp 


Phe 


Pro Pro Leu 


Asp 


Ser 


Asn 


Glu 


Pro 


lie 


His 








245 






250 










255 


Lys Phe 


Gly Phe 


Ser 


Thr Leu Ala 


Leu 


Val 


Glu 


Lvs 


Tvr 


Ser 


Ser 






£* \J V 






265 










270 


Ir X U \Jl 


Leu 


Thr 


Ser 


L.vs Gl u Ser 


Leu 


Phe 


Val 


Ara 


He 


Asn 


Ala 














280 










285 


Ala Ui c 

Ala tllS 


Gly Phe 


Cor 
oci 


T on Tl o f2l ti 

liCU 11C V3111 


vai 


A C5T> 


Asn 


Thr 


Lys 


Val 


Thr 








£ y u 






295 










300 


Mot- T \/c 


Glu 


He 


lit; LI 


T.oii T.vrc Hi a 


Val 


Lys 


A m 
riiy 


Ara 


Lys 


Glv 

oiy 


Ser 








J U J 






310 










315 


n T,\/e 


Val 


Ser 




riu oiu i _y i 


Aig 


Leu 


Glu 


Lys 


Gin 


Ser 


Glu 














325 










330 


n vj noil 


Val 


Ala 


Val 


Asn Leu Asrj 


Ser 


Thr 


Leu 


Glu 


Ser 


Gin 


Ser 








335 






340 










345 


Ala. Trp 


Glu 


Phe 


Cvs 


Leu Val Ara 


Glu 


Asn 


Ser 


Ser 


Arg 


Ala 


Asd 








350 






355 










360 


Glv Val 

V?XV VU-aL 


Phe 


Glu 


Glu 


Asp Ser Gin 


He 


Asp 


He 


Ala 


Thr 


Val 


Gin 








365 






370 










375 


21 en Met 


Leu 


Ser 


Ser 


His His Tyr 


Lys 


Ser 


Phe 


Lys 


Val 


Ser 


Met 








380 






385 










390 


lie His 


Arg 


Leu 


Arcr 


Phe Thr Thr 


Asd 


Val 


Gin 


Leu 


Gly 


lie 


Ser 








395 






400 










405 


Gly Asp 


Lys 


val 


Glu 


He Asp Pro 


Val 


Thr 


Asn 


Gin 


Lys 


Ala 


Ser 








410 






415 










420 


Thr Lys 


Phe 


Trp 


He 


Lys Gin Lys 


Pro 


He 


Ser 


He 


Asp 


Ser 


Asp 








425 






430 










435 


Leu Leu 


Cys Ala 


Cys 


Asp Leu Ala 


Glu 


Glu 


Lys 


Ser 


Pro 


Ser 


His 








440 






445 










450 


Ala lie 


Phe Lys 


Leu 


Thr Tyr Leu 


Ser 


Asn 


His 


Asp 


Tyr 


Lys 


His 








455 






460 










465 


Leu Tyr 


Phe 


Glu 


Ser 


Asp Ala Ala 


Thr 


Val 


Asn 


Glu 


He 


Val 


Leu 








470 






475 










480 


Lys Val 


Asn Tyr 


He 


Leu Glu Ser 


Arg 


Ala 


Ser 


Thr 


Ala 


Arg 


Ala 
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485 

Asp Tyr Phe Ala Gin Lys Gin Arg Lys 
500 

Phe Ser Phe Gin Lys Glu Lys Lys Ser 
515 

<210> 48 
<211> 316 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1976780CD1 

<400> 48 



Met 


Ala 


Ser 


Lys 


Asp Lys 


Ser 


Ssr Lys 


T.vre 


Asn Val 


Phe 


Glu 


Leu 


1 








5 






J. u 








1 c 


Lys 


Thr 


Arg 


Gin 


Gly Thr Glu 


LcU Lieu 


lie 


Gin Ser 


Asp 


Asn 


Asp 










20 
















Thr 


Val 


He 


Asn 


Asp Trp 


Phe 


LyS Vdi 


Leu 


Ser Ser 


Thr 


lie 


Asn 










35 














*± j 


Asn Gin 


Ala 


Val 


Glu Thr Asp 




Lie 


Glu Glu 


Glu 


lie 












50 






DD 








Du 


Asp 


Ser 


Pro Gly 


He Glu 


Lys 


HIS ASp 


Lys 


Glu Lys 


Glu 


Gin 


Lys 










65 






/ u 








/ D 


Asp 


Pro 


Lys 


Lys 


Leu Arg 


Ser 


rxie L»ys 


val 


Ser Ser 


lie 


Asp 


Car 










80 














-7 V 


Ser 


Glu 


Gin 


Lys 


Lys Thr 


Lys 


Lys Asn 


Leu 


Lys Lys 


Phe 


Leu 


1 nr 










95 






1 u u 








1 n R 

X U J 


Arg Arg 


Pro 


Thr 


Leu Gin 


Ala 


vax Arg 


vjiu 


Lys Gly Tyr 


lie 


Lys 










110 






1 1 c 
113 








1 z u 


Asp Gin 


Val 


Phe 


Gly Ser Asn 


Leu Ala 


Asn 


Leu Cys Gin Arg 


olU 










125 






1 "3 A 

XO U 








1 3D 


Asn Gly 


Thr 


Val 


Pro Lys 


Phe 


Val LryS 


Leu 


Cys lie 


Glu 


His 


Val 










140 






1 

iy j 








1 


Glu 


Glu 


His Gly 


Leu Asp 


lie 


Asp oiy 


lie 


Tyr Arg Val 


Ser 


f2l v 
vjiy 










155 






160 








165 


Asn 


Leu 


Ala 


Val 


He Gin 


Lys 


Leu Arg 


Phe 


Ala Val 


Asn 


His 


Asp 










170 






175 








180 


Glu 


Lys 


Leu 


Asp 


Leu Asn Asp 


Ser Lys 


Trp 


Glu Asp 


He 


His 


Val 










185 






190 








195 


He 


Thr 


Gly Ala 


Leu Lys 


Met 


Phe Phe 


Arg 


Glu Leu 


Pro 


Glu 


Pro 










200 






205 








210 


Leu 


Phe 


Thr 


Phe 


Asn His 


Phe 


Asn Asp 


Phe 


Val Asn 


Ala 


He 


Lys 










215 






220 








225 


Gin 


Glu 


Pro Arg 


Gin Arg Val 


Ala Ala 


Val 


Lys Asp Leu 


He 


Arg 










230 






235 








240 


Gin 


Leu 


Pro 


Lys 


Pro Asn 


Gin 


Asp Thr 


Met 


Gin lie 


Leu 


Phe 


Arg 










245 






250 








255 


His 


Leu 


Arg Arg 


Val lie 


Glu 


Asn Gly 


Glu 


Lys Asn Arg Met 


Thr 










260 






265 








270 


Tyr 


Gin 


Ser 


He 


Ala lie 


Val 


Phe Gly 


Pro 


Thr Leu 


Leu 


Lys 


Pro 










275 






280 








285 


Glu 


Lys 


Glu 


Thr 


Gly Asn 


He 


Ala Val 


His 


Thr Val 


Tyr Gin 


Asn 










290 






295 








300 


Gin 


He 


Val 


Glu 


Leu lie 


Leu 


Leu Glu 


Leu 


Ser Ser 


He 


Phe 


Gly 










305 






310 








315 



Arg 

<210> 49 

<211> 387 

<212> PRT 

<213> Homo sapiens 



490 495 
Leu Asn Arg Arg Thr Ser 
505 510 
Gly Gin Gin 
520 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 2048234CD1 

<400> 49 

Met Val His Cys Ser Cys Val Leu 

1 5 
lie Asp Lys Leu Arg Leu Phe Thr 
20 

Gly Tyr Pro Arg Leu Gly Gly Glu 
35 

Trp Val Val Ala Gin Asn Arg Met 
50 

Arg Tyr Pro Arg Lys Arg Phe Val 
65 

Lys lie Ser Ala Leu Lys Gly Ser 
80 

Pro Val Pro Val Gly He Ser Val 
95 

He Gly Glu Leu Ser Lys Glu Asn 
110 

Gly Gly Leu Gly Gly Lys Leu Leu 
125 

Gly Gin Lys Arg He He His Leu 
140 

Val Gly Leu Val « Gly Phe Pro Asn 

Ser Cys Val Ser His Ala Lys Pro 
170 

Thr Thr Leu Lys Leu Lys Leu Gly 
185 

Lys Gin He Ser Val Ala Asp Leu 
200 

His Met Asn Lys Gly Met Gly His 
215 

Arg Thr Arg Gin Leu Leu Phe Val 
230 

Leu Ser Ser His Thr Gin Tyr Arg 
245 

Leu Leu Thr Lys Glu Leu Glu Leu 
260 

Lys Pro Ala Leu Leu Ala Val Asn 
275 

Gin Asp Lys Phe His Glu Leu Met 
290 

Asp Phe Leu His Leu Phe Glu Lys 
305 

Val Glu Phe Gin His He He Pro 
320 

Gly He Glu Glu Leu Lys Asn Cys 
335 

Gin Ala Asn Gin Glu Asn Asp Ala 
350 

Asn Leu Trp He Ser Asp Thr Met 
365 

Lys His Ala Val Thr Thr Ser Lys 
380 

<210> 50 

<211> 334 

<212> PRT 

<213> Homo sapiens 

<220> 



Phe 


Ara 


Lys 


Tyr 


Gly 


Asn 


Phe 




10 










15 


Arcr 


Glv 


Gly Ser 


Glv 


Gly 


Met 




25 










30 


Glv 


Glv 


Lys Gly 


Glv 


Asp 


Val 




40 










45 


Thr 


Leu 


Lys 


Gin 


Leu 


LVS 


Asp 




55 










60 


Ala 


Glv 


Val 


Gly 


Ala 


Asn 


Ser 




70 










75 


Lys 


Glv 


Lys 


Asp 


*■ Xf 


Glu 


He 




85 










90 




Asp 


Glu 


Asn 


Glv 
oxy 


Lys 


He 




100 










105 


Asp 


axq 


He 


Leu 


Val 


Ala 


Gin 




115 

X X -J 










120 


Thr 


& en 
noil 


Phe 


Leu 


Pro 


Leu 


Lys 




110 

X J \J 










135 


Asp 


Leu 


Lys 


Leu 


11c 




Ad ky 




i 45 

X*± -J 










150 


Ala 


Gl \r 


Lys 


Ser 


Car 
OCX 




Leu 




1DU 










1 65 

X o -> 


Ala 


lie 


Ala Asp 


iyr 


Ala 






1 75 
x / _> 










180 


Lys 


lie 


Met 


Tyr 


Cot* 

OCX 




Pho 
rue 




1 90 

X J V 










195 

1- Z> -J 


Pro 


<oiy 


Leu 


He 


Gl ii 


Gl vr 


Ala 




9 05 










910 
£ x u 


Lys 


.rile 


Leu 


Lys 


Hi Q 
nxo 


Tl o 
lie 


Gl n 

VJX u 




99 0 










225 


val 


Asp 


He 


Ser 


Gl \r 




Gin 




915 










240 


X I AX 


Ala 


Phe 


Glu 


Thr 


He 


He 




250 










255 


ryr 


T .wc 
Ijyo 


Glu 


Glu 


Leu 


Gin 


Thr 




265 










270 




Met 


Asp 


Leu 


Pro 


Asp 


Ala 




980 










285 


Ser 


Gin 


Leu 


Gin 


Asn 


Pro 


Lys 




295 










300 


Asn 


Met 


He 


Pro 


Glu 


Arg 


Thr 




310 










315 


lie 


Ser 


Ala 


Val 


Thr 


Gly 


Glu 




325 










330 


He 


Arg 


Lys 


Ser 


Leu 


Asp 


Glu 




340 










345 


Leu 


His 


Lys 


Lys 


Gin 


Leu 


Leu 




355 










360 


Ser 


Ser 


Thr 


Glu 


Pro 


Pro 


Ser 




370 










375 


Met 


Asp 


He 


He 









385 
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<221> misc_f eature 

<223> Incyte ID No: 2111754CD1 

<400> 50 



Met Pro Ser Glv 


Pro 


Ax a 


Ala 


Ala Leu 


Arg 


Trp Ala Ser 


Pro 


Ser 


1 


5 








10 






15 


Gin Leu Val Ser 




His 


Val 


Leu Arg 


Asn 


Gly He Tyr 


Ala 


Cys 




20 








25 






30 


Tyr Pro His Ser 


Leu 


Ara 

nx y 


Pro 


Arg Thr Pro 


Leu Leu Cys Ala 


Ser 










40 






45 


A tci Asn Tie LiVS 


Pro 


Ara 


Ara 


Ser Glu 


Leu 


Leu Gly Cys 


Pro 


Val 




50 








55 






60 


VJ X y \w jr O Al y VJ X y 


Ser 


Leu 


Ser 


Glu Gin Arg 


He Cys Leu 


Leu 


Gly 












70 






75 


vya uCU val <rix y 


Ala 

Ala 


Ser 


Glu 


Lys Gly Val 


Ser Cys Cys Gin 


Leu 




SO 








85 






90 


del val oXy olU 


T on 
Ijtr LI 


v ax 


nx 0 


Val Ser 


Pro 


Leu Arg He 


Pro 


Thr 


Q5 








100 






105 


u er pl u Acn Ala 
net «iy ftsn Ala 


Car 
OCX 


XT XltS 


vsxy 


Ser Lys 


Glu 


Gin Lys Leu 


Leu 


Lys 


inn 

1 1 u 








115 






120 


ni y bcu niy lieu 


T.01 1 


Pro 


Ala 


Leu Leu 


He 


Leu Arg Ala Phe 


Lys 




1Z J 








130 






135 


riO rixS ATg l«ys 


lie 


Arg 


Asp 


Tyr Arg 


Val 


Val Val Val 


Gly 


Thr 




14U 








145 






150 


A 1 0 ol v Val f^lvr 
Ala vsiy val wly 


Lys 


Cor 

OCX 


J. in 


Leu Leu 


His 


Lys Trp Ala 


Ser 


Glv 




133 








160 






X v J 


Asn fne nig nis 




Tyr 


T .01 i 


Pro Thr 


He 


Glu Asn Thr 


Tyr 


\~y & 




1 / u 








175 






180 


\sj.H JutrLi Jjcu uiy 




Ser 


His 


Gly Val 


Leu 


Ser Leu His 


He 


Thr 




103 








190 






195 


Asp Ser Lys Ser 


oiy 


AO L; 


uiy 


Asn Arg Ala 


Leu Gin Arg His 


Val 




z u u 








205 








lie Aia Arg \3±y 


Ml S 


Ala 
Ala. 


IT 1 1" 


Val Leu 


Val 


Tyr Ser Val 


Thr 


Lys 












220 






225 


ijys ijiu 1 ni lieu 




Pi 1-, 
VjI LI 


T .01 1 
1/t: LI 


Lys Ala 


Phe 


Tyr Glu Leu 


He 


Cys 




^ J V 








235 






240 


Ltyo 116 Liya Vsiy 


Asn 


Asn 


Ton 


His Lys 


Phe 


Pro He Val 


Leu 


Val 












250 






255 


Gly Asn Lys Ser 


Asp 


Asp 


Thr 


His Arg Glu 


Val Ala Leu 


Asn 


Asp 




260 








265 






270 


Gly Ala Thr Cys 


Ala 


Met 


Glu 


Trp Asn Cys 


Ala Phe Met 


Glu 


He 


275 








280 






285 


Ser Ala Lys Thr 


Asp 


Val 


Asn 


Val Gin 


Glu 


Leu Phe His 


Met 


Leu 




290 








295 






300 


Leu Asn Tyr Lys 


Lys 


Lys 


Pro 


Thr Thr Gly 


Leu Gin Glu 


Pro 


Glu 




305 








310 






315 


Lys Lys Ser Gin 


Met 


Pro 


Asn 


Thr Thr 


Glu 


Lys Leu Leu 


Asp 


Lys 




320 








325 






330 



Cys He He Met 

<210> 51 
<211> 551 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 21232S6CD1 

<400> 51 

Met Glu Glu Glu Leu Pro Leu Phe Ser Gly Asp Ser Gly Lys Pro 

15 10 15 

Val Gin Ala Thr Leu Ser Ser Leu Lys Met Leu Asp Val Gly Lys 
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Trp Pro lie 
Arg Gin Ala 
Thr Thr Val 
Gly Cys Leu 
Arg Leu Asp 
Gly Ser Gly 
Phe Thr Trp 
Thr Asn His 
Asn Lys Gin 
Val Leu Thr 
Ser Gly Gin 
Arg Arg Val 
Ala Cys Gly 
Val Tyr Val 
Asn Ser Gly 
Gly lie Arg 
Val Leu Thr 
Tyr Gly Gin 
Thr Pro Val 
Cys His Ser 
Val Tyr Met 
His Leu Thr 
Ala Thr Pro 
Asp His Leu 
Pro Asp Thr 
Tyr Ala His 
Ser Ser Leu 
Phe Ser Tyr 
Asp Ser lie 
Leu Ala Thr 
Gin Thr He 





20 




Phe 


Ser 


Leu 




35 




Cys 


Val 


Phe 




50 




Asn 


Asp 


Glu 




65 




Gly 


Leu 


Gly 




80 




Ser 


Leu 


Asn 




95 




Pro 


His 


He 




110 




Gly 


His 


Asn 




125 




Gly 


Leu 


Val 




140 




Val 


He 


Glu 




155 




Ser 


Asp 


Gly 




170 




Val 


Gly 


Ser 




185 




Thr 


Gly 


Cys 




200 




Gin 


Met 


Cys 




215 




Trp 


Gly 


Tyr 




230 




Asn 


Gin 


Pro 




245 




Val 


Gin 


Arg 




260 




Asp 


Glu 


Gly 




275 




Leu 


Gly 


Thr 




290 




Thr 


Val 


Glu 




305 




Thr 


His 


Thr 




320 




Trp 


Gly 


Gin 




335 




His 


Phe 


Ser 




350 




Ala 


Val 


Thr 




365 




Thr 


Val 


Ala 




380 




Ala 


Asp 


Leu 




395 




Lys 


Val 


Leu 




410 




Glu 


Asp 


Asn 




425 




Pro 


Val 


Tyr 




440 




Ser 


Leu 


Ser 




455 




Phe 


Tyr 


Arg 




470 




Lys 


Gin 


Gly 




485 





cys 


t>er 


(jlU 


Giy 


Cor 


Ala 


lie 


pne 


vai 


Asp 


vai 


Gin 


Gly 


Lys 


Lys 


Val 


Leu 


Ala 


Ala 


Tyr 


Ser 


Pro 


Cys 


His 


Val 


Ala 


Cys 


Glu 


Val 


Phe 


Gly 


Ser 


Thr 


Leu 


Gin 


Asn 


Cys 


Met 


Ala 


Asn 


Gly 


Asn 


Thr 


Pro 


Cys 


Val 


Ala 


Cys 


Gin 


Val 


Tyr 


Gly 


Asn 


Lys 


Lys 


Asp 


Arg 


Ser 


Ala 


Ala 


Cys 


Arg 


Gly 


Cys 


Thr 


Asp 


Trp 


Arg 


Leu 


Glu 


Ser 


Leu 


Lys 


Pne 


Leu 


Leu 


Lys 


He 


Glu 


Asp 


Asp 


Arg 


Ala 


Phe 


Pro 


Glu 


Glu 


Glu 


Asn 


Arg 


He 


Cys 


Glu 



25 






Glu 


Glu 


Leu 


40 






Gly 


Asn 


Glu 


55 






Leu 


Gly 


Thr 


70 






Ser 


Thr 


lie 


85 






He 


Ala 


Cys 


100 






Thr 


Thr 


Glu 


115 






Gin 


Leu 


Gly 


130 






He 


Ser 


Thr 


145 






Gly 


Ser 


Tyr 


160 






Ala 


Trp 


Gly 


175 






Val 


Asn 


Gin 


190 






Lys 


Val 


Val 


205 






Val 


Val 


Asp 


220 






Gly 


Gin 


Leu 


235 






Arg 


Val 


Ala 


250 






Gly 


Tyr 


Ala 


265 






Ala 


Trp 


Gly 


280 






Ser 


Asn 


Gin 


295 






lie 


He 


Glu 


310 






Lys 


Thr 


Gin 


325 






Gin 


Ser 


Val 


340 






Asp 


Val 


Phe 


355 






Leu 


Ser 


Val 


370 






Lys 


Arg 


Glu 


385 






Val 


Asp 


Gly 


400 






Arg 


Cys 


Glu 


415 






He 


Val 


Glu 


430 






Leu 


Glu 


Tyr 


445 






Ala 


Val 


Gly 


460 






Leu 


Lys 


Lys 


475 






Glu 


Asn 


Ala 


490 











30 


Gin 


Leu 


lie 






45 


Val 


Leu 


Tyr 






60 


Asn 


Cys 


Cys 






75 


Glu 


Pro 


Arg 






90 


Leu 


Ser 


Tyr 






105 


Gly 


Glu 


Val 






120 


Asn 


Gly 


Thr 






135 


Asn 


Leu 


Ser 






150 


His 


Ser 


Leu 






165 


Tyr 


Asn 


Asn 






180 


Pro 


He 


Pro 






195 


Val 


Thr 


lie 






210 


Thr 


Gly 


Glu 






225 


Gly 


Leu 


Gly 






240 


Ala 


Leu 


Gin 






255 


His 


Thr 


Leu 






270 


Ala 


Asn 


Ser 






285 


Ser 


Tyr 


Pro 






300 


He 


Ala 


Ala 






315 


Gly 


Gly 


His 






330 


He 


Leu 


Pro 






345 


Ala 


Cys 


Phe 






360 


Glu 


Pro 


Asp 






375 


Phe 


Asp 


Asn 






390 


Lys 


Tyr 


lie 






405 


His 


Phe 


Arg 






420 


Met 


Ser 


Glu 






435 


Leu 


Tyr 


Thr 






450 


Leu 


Leu 


Asp 






465 


Leu 


Cys 


Gin 






480 


lie 


Ala 


Leu 






495 
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Leu Ser 


Ala 


Ala Val 


Lys 


Tyr Asp Ala 






500 






Cys Phe 


Arg 


Phe Cys 


He 


Asn His Leu 






515 






Ser Gly 


Phe 


Ala Glu 


Met 


Asp His Asp 






530 






Ser Lys 


Ala 


Ser Arg 


Val 


Gly Ala Phe 






545 







<210> 52 

<211> 308 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2477507CD1 

<400> 52 



Met 


He 


His 


Asp 


Ala 


Gin 


Met 


Asd Tvr 










5 








Thr 




Ser 


Ser 


Asp 




Ser 


Val Lys 










20 








uiy 


ri~\ vr 
uiy 


Gin 


He 


Leu 


Tip 


Ala 


Ion T.Alt 
Asp iiCU 










j -j 








Val 


1 J - V 


Gin 


Val 


Ala 


Tj~rt 

*■ ** tr 


Ala 


His Pro 










so 








Ala 


Ser 


Cys 


Ser 




Asp 


Arg 












65 








Asn 


Gly 


Thr 


Trp 


Glu 


Lys 


Ser 


His Glu 










80 








Ser 


Val 


Asn 


Ser 


Val 


Cvs 


Trn 


Ala Pro 










95 








Leu 


Ala 


Cys 


Gly 


Ser 


Ser 


Aso 


Gly Ala 










110 








Thr 


Glv 


Glu 


Gly 


Gin 




Glu 


Val Lys 










125 








Thr 


He 


Gly Cys 


Asn 


Ala 


Val 


Ser Trp 










140 








Gly 


Ser 


Leu 


lie 


Asp 


His 


Pro 


Ser Gly 










155 








Lys 


Arg 


Phe 


Ala 


Ser 


Gly 


Gly 


Cys Asp 










170 








Lys 


Glu 


Glu 


Glu 


Asp 


Gly 


Gin 


Trp Lys 










185 








Ala 


His 


Ser Asp 


Trp 


Val 


Arg 


Asp Val 










200 








Gly 


Leu 


Pro 


Thr 


Ser 


Thr 


He 


Ala Ser 










215 








Val 


Phe 


He 


Trp 


Thr 


Cys 


Asp 


Asp Ala 










230 








Pro 


Lys 


Leu 


Leu 


His 


Lys 


Phe 


Asn Asp 










245 








Trp 


Ser 


He 


Thr 


Ala 


Asn 


He 


Leu Ala 










260 








Lys 


Val 


Thr 


Leu 


Trp 


Lys 


Glu 


Ser Val 










275 








He 


Ser 


Asp Val 


Asn 


Lys 


Gly 


Gin Gly 










290 








Thr 


Glu 


Gly Gin 


Gin 


Asn 


Glu 


Gin 










305 









<210> 53 
<211> 949 
<212> PRT 
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Gin 


Asp Leu Glu Glu 


Phe 


505 




510 


Thr 


Val Val Thr Gin 


Thr 


520 




525 


Leu 


Leu Lys Asn Phe 


He 


535 




540 


Lys 


Asn 




550 






Tyr 


Gly Thr Arg Leu 


Ala 


10 




15 


He 


Phe Asp Val Arg 


Asn 


25 




30 


Arg 


Gly His Glu Gly 


Pro 


40 




45 


Met 


Tyr Gly Asn He 


Leu 


55 




60 


He 


He Trp Arg Glu 


Glu 


70 




75 


His 


Ala Gly His Asp 


Ser 


85 




90 


His 


Asp Tyr Gly Leu 


He 


100 




105 


He 


Ser Leu Leu Thr 


Tyr 


115 




120 


Lys 


He Asn Asn Ala 


His 


130 




135 


Ala 


Pro Ala Val Val 


Pro 


145 




150 


Gin 


Lys Pro Asn Tyr 


He 


160 




165 


Asn 


Leu He Lys Leu 


Trp 


175 




180 


Glu 


Glu Gin Lys Leu 


Glu 


190 




195 


Ala 


Trp Ala Pro Ser 


He 


205 




210 


Cys 


Ser Gin Asp Gly 


Arg 


220 




225 


Ser 


Ser Asn Thr Trp 


Ser 


235 




240 


Val 


Val Trp His Val 


Ser 


250 




255 


Val 


Ser Gly Gly Asp 


Asn 


265 




270 


Asp 


Gly Gin Trp Val 


Cys 


280 




285 


Ser 


Val Ser Ala Ser 


Val 


295 




300 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2759119CD1 

<400> 53 

Met Asp Ala Leu Glu Asp Tyr Val Trp 

1 5 
Leu lie Leu Leu Pro Val Thr Gly Leu 
20 

Leu Leu Ala Gly Glu Gly Pro Asp Val 
35 

Phe Gly Gly His Leu Arg Met lie Lys 
50 

Gly His Tyr Leu lie His Gly Phe Arg 
65 

Gly Asp Leu Asp Leu Glu Ala Met Val 
80 

Gly Leu Arg Val Val Lys lie Ser Trp 
95 

Glu Leu Trp Arg Ser Gly Leu Trp Asn 
110 

Asp Ala Arg Trp Leu Glu Gly Asn He 
125 

Asn Ser Val Val Leu Tyr Asp Pro Val 
140 

Glu Val Pro Cys Thr Asp Arg Cys Thr 
155 

He Gly Asp Ala Trp Lys Glu Leu Thr 
170 

Ser Asn Gin Leu Leu Val Trp Tyr Pro 
185 

Asn Lys Pro Val Ala Pro Asp Arg Arg 
200 

He He Phe Ser Met Ser Tyr Leu Glu 
215 

Thr Ala Ser Glu Asp Arg Ser Val Arg 
230 

Leu Arg Val Pro Gly Gly Arg Val Gin 
245 

Gly His Ser Ala Arg Val Trp Gin Val 
260 

Leu He Ser Ala Gly Glu Asp Cys Val 
275 

Glu Gly Glu He Leu Gin Ala Phe Arg 
290 

He Arg Ala He Ala Ala His Glu Arg 
305 

Gly Gly Asp Asp Ser Gly He Arg Leu 
320 

Gly Tyr Arg Gly Leu Gly Val Ser Ala 
335 

Ser Arg Pro Gly Thr Leu Lys Ala Val 
350 

Arg Leu Leu Ala Val Thr Asp Thr Gly 
365 

Val Glu Val Lys Cys Trp Glu Gin Leu 
380 

Gin Ser Tyr Cys Leu Leu Glu Ala Ala 
395 

Gly Leu Cys Ala Met Ala Asn Gly Glu 
410 
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Pro Arg Ala 


Thr Ser Glu 


10 


15 


Glu Cys Val 


Gly Asp Arg 


25 


30 


Leu Val Tyr 


Ser Leu Asp 


40 


45 


Arg Val Gin 


Asn Leu Leu 


55 


60 


Val Arer Pro 


Glu Pro Asn 


70 


75 


Ala Val Phe 


Gly Ser Lys 


85 


90 


r;l v Gin Glv 
yjxy oj.1i w i y 


His Phe Tm 

lllw t 11V J- 


1 no 

X v V 


105 


FJeL OCX £S 


Trn Tie Tit* 
iiy lie i ip 




120 


& 1 a T .on Ala 
aXci Lieu r\± a 


T.en Glv His 

iJCU vjijr iii-O 


130 


135 


vdx o J. _y ty& 


Tin T i Gin 
116 UC LI Ulll 


145 


150 


t ,oi i Car Cor 

J-JC? Li OCX OCX 


Ala Cvs Leu 
ilia v-jro ucui 




165 


Tip Val Ala 

11C VCI1 Al» 


Glv Ala Val 

v3 1 Jr Aid VO.1 


175 


180 


A 1 3 Th r* Ala 


T,eu Ala Asra 


190 


195 


Tl o Cor vr 
lie ocx ui_y 


His Val Glv 

nio veil wij 


205 


210 


Cpr T.VS Glv 


Leu Leu Ala 


220 


225 


1JLC lip iJj o 


Val Glv Asn 

VCll VJljr nop 


235 


240 


Asn Tie Glv 


His C vs Phe 


250 


255 


Lvs Leu Leu 


Glu Asn Tyr 


265 


270 


Cys Leu Val 


Trp Ser His 


280 


2 85 


Glv His Gin 


Gly Arg Gly 


295 


300 


Gin Ala Trp 


Val lie Thr 


310 


315 


Tm His Leu 


Val Gly Arg 


325 


330 


Leu Cys Phe 


Lys Ser Arg 


340 


345 


Thr Leu Ala 


Gly Ser Trp 


355 


360 


Ala Leu Tyr 


Leu Tyr Asp 


370 


375 


Leu Glu Asp 


Lys His Phe 


385 


390 


Pro Gly Pro 


Glu Gly Phe 


400 


405 


Gly Arg Val 


Lys Val Val 


415 


420 
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Pro 


He 


Asn 


Thr 


Pro 
425 


Thr 


Ala 


Ala 


Val 


Asp 
430 


Gin 


Gly 


Lys 


val 


His 


Ser 
440 


Leu 


Ser 


Trp 


Ala 


Leu 
445 


Arg 


Leu 


Leu 


Leu 


Leu 


Ala 
455 


Ser 


Glv 


Pro 


Glv 


Glv 
460 


Val 


Glu 


He 


Ser 


Ala 


Ala 
470 


Pro 


Ser 


Gly 


Lys 


Ala 
475 


He 


Arg 


Cys 


Arg 


Tyr 


Leu 
485 


Leu 


Pro 


Pro 


Ser 


Lys 
490 


Gin 


Cys 


Ser 


Ala 


Phe 


Leu 


Pro 


Pro 


Gly 


Asp 


Phe 


Leu 








500 










505 




Arg 


Arg 


Gly 


Ser 


Val 
515 


Leu 


Leu 


Phe 


Pro 


Ser 
520 


Arg 


Lys 


Asp 


Pro 


Gly 


Val 
530 


Gly 


Gly 


Lys 


Ala 


Arg 
535 


Ala 


Pro 


Val 


Val 


Gly 


Ser 
545 


Gly 


Ser 


Ser 


Gly 


Gly 
550 


Gly 


Gly 


Leu 


Gly 


Pro 


Val 

J o u 


Ser 


Thr 


Leu 


Pro 


Ser 
565 


Leu 


Gly 


Val 


Thr 


Ser 


val 

575 


Thr 


Cys 


His 


Gly 


Gly 
580 


Tyr 


Gly 


Arg 


Asp 


Gly 


Ala 

590 


Tyr 


Tyr 


Gin 


Leu 


Phe 
595 


Val 


Leu 


Gin 


Pro 


Val 


Leu 

G05 


Arg 


Gin 


Lys 


Ser 


Cys 
610 


Arg 


Leu 


Ala 




Leu 


A tret 

620 


lie 


Val 


Pro 


Asp 


Glv 
625 


Ser 


Gly 


Phe 


His 


Ala 


Asn 
635 


Glu 


Phe 


Val 


Val 


Trp 
640 


Asn 


Glu 


Lys 


Leu 


His 


He 

650 

O J u 


Val 


Asn 


Cys 


Gly 


Gly 
655 


Gly 


Ala 


Phe 


Ser 


Asp 


Thr 


Glu 


Ala 


Ala 


Met 


Ala 


Phe 








£K5 

OO J 










670 

V? / V/ 




Asp 


Gly 


Asp 


Val 


Met 
6R0 

o O \J 


Leu 


Tyr 


Arg 


Ala 


Leu 
685 


Gly 


Pro 


His 


Val 


He 


Leu 

695 


Arg 


Glu 


Gly 


Leu 


His 
700 


Gly 


Cys 


Val 


Lys 


Arg 


Val 
71 0 

/XV/ 


Gly 


Thr 


He 


Thr 


Leu 
715 


Gly 


Val 


Pro 


Ser 


Phe 


Met 
72 5 


Gin 


Pro 


Asp 


Asp 


Leu 
730 


Glu 


Gly 


Pro 


Asp 


Leu 


Thr 
740 


Asp 


He 


Val 


He 


Thr 

745 


Cys 


Thr 


Val 


Cys 


Val 


Leu 
755 


Ala 


Leu 


Pro 


Thr 


Thr 
760 


Thr 


Ala 


Leu 


Thr 


Ala 


Val 
770 


Cys 


Asn 


His 


He 


Ser 
775 


Ser 


Ala 


Val 


Trp 


Gly 


He 
785 


Gly 


Thr 


Pro 


Gly 


Gly 
790 


Pro 


Pro 


Gly 


Leu 


Thr 


Ala 
800 


His 


Val 


Val 


Ser 


Ala 
805 


Gly 


Met 


His 


Cys 


Phe 


Ser 
815 


He 


Met 


Val 


Thr 


Pro 
820 


Asp 


Ser 


Arg 


Leu 


Ala 


Cys 

© ■** \J 


His 


Val 


Met 


His 


Leu 
835 


Ser 


Asp 


Glu 


Tyr 


Trp 


Asp 
845 


Arg 


Gin 


Arg 


Asn 


Arg 
850 


His 


Val 


Asp 


Pro 


Glu 


Thr 
860 


Arg 


Tyr 


Met 


Ser 


Leu 
865 


Ala 


Asp 


Gin 


Pro 


Gly 


Leu 
875 


Gly 


Pro 


Leu 


Val 


Ala 
880 


Ala 


Gly 


Ala 


Val 


Ser 


Ser 


Phe 


Phe 


Cys 


Arg 


He 


Leu 
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Thr 


Leu 


Phe 


Pro 
435 
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Tyr 


Glu 


Glu 
450 


Val 


Ala 


Cys 


Leu 
465 


Phe 


Val 


Lys 


Glu 
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Arg 


Trp 


His 


Thr 
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Val 


Cys 


Gly 
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Leu 
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Gly 
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Lys 
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Met 
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Trp 
615 


Met 


Val 


He 
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Arg 
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Trp 
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Lys 
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Arg 
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He 
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Gly 
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Ser 
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Ser 
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Asp Val Ser Ser 


Arg 
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Ser Thr Pro Leu 
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Met 
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25 




30 
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He 
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Val Gly Lys Thr 
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Thr Ser Ala Phe 


Val 




50 
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75 
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He 


Trp 
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<211> 547 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2861155CD1 

<400> 56 

Met Lys Thr Leu Glu Thr Gin Pro Leu 

1 5 
Ser Asp Gin Asp Pro Ala Pro Ala His 
20 

Pro Met Asn Lys Asn Ala Asp Ser Glu 
35 

Glu Arg Gly Asp Pro Pro Arg Leu Ser 
50 

Ser Ala Val Ser Gin Glu Leu Arg Glu 
65 

Ser Thr Pro Leu Glu Thr Glu Phe Gly 
80 

Pro Arg He Glu Glu Gin Glu Leu Ser 
95 

Ala Glu Glu Ala Asn Gly Ser Leu Ser 
110 

Pro Glu Leu Gly Ser Gly Lys Ala Met 
125 

Pro Ala Ala Glu Asp Glu Gly Asp Thr 
140 

Ser Gin Leu Pro Arg Phe Leu Ser Gly 
155 

Thr Gin Pro Glu Asn Phe Leu Lys Gly 
170 

Gly Ser Cys He Leu Thr Asn Ser Ala 
185 

Tyr Asn Leu Pro Pro Glu Leu Tyr His 
200 

Tyr Ala Glu Met Val Pro Val Leu Arg 
215 

He Tyr Asp Tyr Cys Trp Tyr Ser Leu 
230 

Asp Thr Ser Tyr Val Ala Ser Ser Ser 
245 

He Trp Asp Ala Phe Thr Gly Glu Leu 
260 

Tyr Asn His Leu Asp Glu Leu Thr Ala 
275 

Ser Pro Asp Gly Ser Gin Leu Phe Cys 
290 

Arg Val Phe Ser Thr Ala Arg Pro Gly 
305 

Ala Thr Phe Ala Lys Lys Gin Gly Gin 
320 

He Ala Phe Ser Pro Ala Gin Pro Leu 
335 

Gly Arg Ser Leu Gly Leu Tyr Ala Trp 
350 

Ala Leu Leu Gly Gly His Gin Gly Gly 
365 

His Pro Asp Gly Asn Arg Phe Phe Ser 
380 

Glu Leu Leu Cys Trp Asp Leu Arg Gin 
395 
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Ser Leu Gly Arg Glu Val Thr Thr Asn Gin Arg He Tyr Phe Asp 

410 415 420 

Leu Asp Pro Thr Gly Gin Phe Leu Val Ser Gly Ser Thr Ser Gly 

425 430 435 

Ala Val Ser Val Trp Asp Thr Asp Gly Pro Gly Asn Asp Gly Lys 

440 445 450 

Pro Glu Pro Val Leu Ser Phe Leu Pro Gin Lys Asp Cys Thr Asn 

455 460 465 

Gly Val Ser Leu His Pro Ser Leu Pro Leu Leu Ala Thr Ala Ser 

470 475 , 480 

Gly Gin Arg Val Phe Pro Glu Pro Thr Glu Ser Gly Asp Glu Gly 

485 490 495 

Glu Glu Leu Gly Leu Pro Leu Leu Ser Thr Arg His Val His Leu 

500 505 510 

Glu Cys Arg Leu Gin Leu Trp Trp Cys Gly Gly Gly Pro Asp Ser 

515 520 525 
Ser He Pro Asp Asp His Gin Gly Glu Lys Gly Gin Gly Gly Thr 

530 535 540 
Gly Gly Arg Ser Trp Gly Ala 

545 

<210> 57 

<211> 686 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3002667CD1 



<400> 57 
Met Gly Glu 
1 

Pro Ser Gly 
Leu Ala Val 
Ser Ala Asn 
Arg Ala Thr 
Ser Ala Gly 
Leu Asn He 
Ser Met Ala 
Glu Asp Gly 
Gin Phe Glu 
Leu Ser Trp 
Asp Tyr He 
Lys Met He 
Cys He Val 
Ser Val Asp 
Gly Thr Leu 
Ser He Ala 



Phe 


Lys Val 
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He 


Arg Cys 




20 


Ser 


Arg Thr 




35 


Tyr 


Phe Gin 




50 


Glu 


Ala Leu 




65 


Leu 


Asn Gly 




80 


Lys 


Tyr Ala 




95 


Ala 


Ser Pro 




110 


Ser 


Val Lys 




125 


Arg 


Asn Phe 




140 


His 


Pro Ser 




155 


Ser 


Val Phe 




170 


Val 


Asp Arg 




185 


Trp 


Gly Val 




200 


Ser 


Ala Gly 




215 


Val 


Lys Ser 




230 


Val 


Ala Asp 



His Arg Val 
Val Ala Tyr 
Asp Gly Thr 
Glu Lys Phe 
Cys Trp Ala 
Glu He Met 
Met Asp Ala 
Ser Gly Ser 
Leu Phe Gin 
Asp Arg Gin 
Gly Thr His 
Asp Val Lys 
Gin Tyr Met 
Ala Phe Leu 
Lys Val Gin 
His Leu He 
Gin Glu Asp 



Arg Phe Phe 
10 

Asn Asn Gin 
25 

Val Glu He 
40 

Phe Pro Gly 
55 

Glu Gly Gin 
70 

Glu Tyr Asp 
85 

Phe Gly Gly 
100 

Gin Leu Leu 
115 

He Thr Pro 
130 

Lys Ser Arg 
145 

He Ala Ala 
160 

Ser Gly Ser 
175 

Gly Val Ser 
190 

Ser Asp Gly 
205 

Phe Trp Asp 
220 

Ala Asn Ala 
235 

Ser Phe Val 



Asn Tyr Val 
15 

Ser Asn Arg 
30 

Tyr Asn Leu 
45 

His Glu Ser 
60 

Arg Leu Phe 
75 

Leu Gin Ala 
90 

Pro He Trp 
105 

Val Gly Cys 
120 

Asp Lys He 
135 

He Leu Ser 
150 

Gly Ser He 
165 

Ala Val His 
180 

Lys Arg Lys 
195 

Thr He He 

210 

Ser Ala Thr 
225 

Asp Val Gin 
240 

Val Gly Thr 
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245 

Ala Glu Gly Thr Val Phe His Phe Gin 
260 

Asn Ser Ser Glu Lys Gin Trp Val Arg 
275 

His Thr His Asp Val Arg Thr Val Ala 
290 

lie Ser Gly Gly Thr Asp Thr His Leu 
305 

Glu Lys Val Glu Val Lys Asn Tyr Asp 
320 

Thr Phe Pro His Arg Cys Leu lie Ser 
335 

Leu Leu Leu Phe Gin Phe Ala His His 
350 

Gly Ser Thr Val Ala Thr Gly Lys Asn 
365 

Ser Lys Asn Ala Asp His Leu Leu His 
380 

Glu Asn lie lie Cys Ser Cys lie Ser 
395 

Ala Tyr Ser Thr Val Ser Arg Phe Phe 
410 

Glu His Asp Asn lie Ser Leu Lys Arg 
425 

Phe Leu Arg Ser Ala Leu Gin lie Leu 
440 

Lys Leu Phe Val Ala Ser Asn Gin Gly 
455 

Leu Ser Gly Gly Ser Phe Lys His Leu 
470 

Ser Gly Thr Val Glu Ala Met Cys Leu 
485 

Gly Asn Trp Leu Ala Ala Ser Gly Thr 
500 

Tyr Asn Val Lys Gin Leu Lys Leu His 
515 

Asn Phe Pro Val Thr Ala Met Ala lie 
530 

Leu Val lie Ala His Ser Asp Gin Gin 
545 

Pro Asp Lys Gin Tyr Thr Asp Trp Ser 
560 . 

Gly Phe His His Leu Trp Leu Gin Arg 
575 

lie Ser Phe His Pro Lys Arg Pro Met 
590 

Ala Tyr Met Phe Cys lie lie Asp Lys 
605 

Asp Lys Thr Leu Leu Tyr Asn Pro Phe 
620 

Asp Val lie Arg Arg Arg Thr Ala His 
635 

lie Tyr Lys Pro Leu Leu Phe Met Asp 
650 

Leu Val Ala Val Glu Arg Pro Leu Asp 
665 

Pro Pro Pro lie Lys Lys Lys Lys Phe 
680 
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<221> misc_feature 

<223> Incyte ID No: 3043734CD1 

<400> 58 
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<223> Incyte ID No: 3294893CD1 

<400> 59 
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20 


Lys 


Leu 


Pro Gly 


Gly 


Ala 


Gin 


His 


Pro 
35 


Pro 


Phe 


Asp Pro 


Leu 


Arg 


Ser 


Thr 


Ala 


Lys 


Met 


Pro Thr 










50 








Arg 


Val 


Ser 


Thr 


Phe 


Gin 


Glu 


Phe Glu 










65 








Trp 


Asp 


Ala 


Gly 


Glu 


Asp 


Asp 


Asp Glu 










80 








Glu 


Ser 


Leu 


Asn 


Ser 


Glu 


Val 


Val Met 










95 








Leu 


Arg 


Asn 


His 


Ser 


Gin 


Arg 


Gin Gly 










110 








Gly 


Pro 


Gly 


Leu 


Gin 


Gin 


Lys 


Pro Arg 










125 








Ser 


Pro 


Pro 


Ser 


Gly 


Asp 


Leu 


Arg Leu 










140 








Ser 


His 


Thr 


Ser 


Cys 


Pro 


Ala 


Glu Ser 










155 








Leu 


Gin 


Arg 


Ser 


Gin 


Ser 


Leu 


Pro His 










170 








Gly 


Gly 


Thr 


Ser 


Asp 


Pro 


Ser 


Thr Leu 










185 








Glu 


Arg 


Glu 


Ala 


Ser 


Arg 


Leu 


Asp Lys 










200 








Gly 


Pro 


Asn 


Thr 


Asp 


Leu 


Glu 


Glu Leu 










215 








Gly 


He 


Pro 


Lys 


Pro 


Val 


Arg 


Pro Met 










230 








Gly 


Tyr 


Leu 


Pro 


Ala 


Asn 


Val 


Asp Arg 



245 



PCT/US00/19698 



l nr 


o±n L»eu Arg Arg 


Ser 


10 




15 


Ala 


Arg Ala Trp Asp 


Leu 


25 




30 


Leu 


Ala Glu He Glu 


Ala 


40 




45 


Ala 


Gly Phe Pro Gin 


Tyr 


55 




60 


Pro 


He Asn He Val 


Ala 


70 




75 


Lys 


Asp Leu Val Glu 


Pro 


85 




90 



Gly 


Gin 


Glu 


Ala 


He 


Leu 


10 










15 


Ser 


He 


Gin 


His 


Val 


Tyr 


25 










30 


Leu 


Leu 


His 


Gly Thr 


Leu 


40 










45 


Thr 


Pro 


Val 


Lys 


Ala 


Lys 


55 










60 


Ser 


Asn 


Thr 


Ser 


Asp 


Ala 


70 










75 


Leu 


Leu 


Ala 


Met 


Ala 


Ala 


85 










90 


Glu 


Thr Ala Asn Arg 


Val 


100 










105 


Arg 


Pro 


Thr 


Leu 


Gin 


Glu 


115 










120 


Pro 


Glu 


Ala 


Glu 


Pro 


Pro 


130 










135 


Val 


Lys 


Ser 


Val 


Ser 


Glu 


145 










150 


Ala 


Ser Asp Ala Ala 


Pro 


160 










165 


Ser 


Ala 


Thr 


Val 


Thr 


Leu 


175 










180 


Ser 


Ser 


Ser 


Ala 


Leu 


Ser 


190 










195 


Phe 


Lys 


Gin 


Leu 


Leu 


Ala 


205 










210 


Arg 


Arg 


Leu 


Ser 


Trp 


Ser 


220 










225 


Thr 


Trp 


Lys 


Leu 


Leu 


Ser 


235 










240 


Arg 


Pro 


Ala 


Thr 


Leu 


Gin 


250 










255 
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Arg 


Lys 


Gin 


Lys 


Glu 


Tyr 


Phe 


Ala Phe 


He 


Glu 


His 


Tyr 


Tyr 


Asp 










260 








265 










270 


Ser 


Arg 


Asn 


Asp 


Glu 


Val 


His 


Gin Asp 


Thr 


Tyr 


Arg 


Gin 


He 


His 










275 








280 










285 


He 


Asp 


He 


Pro 


Arg 


Met 


Ser 


Pro Glu 


Ala 


Leu 


He 


Leu 


Gin 


Pro 










290 








295 










300 


Lys 


Val 


Thr 


Glu 


He 


Phe 


Glu 


Arg He 


Leu 


Phe 


He 


Trp 


Ala 


He 










305 








310 










315 


Arg 


His 


Pro 


Ala 


Ser 


Gly 


Tyr 


Val Gin 


Gly 


He 


Asn 


Asp 


Leu 


Val 










320 








325 










330 


Thr 


Pro 


Phe 


Phe 


Val 


Val 


Phe 


He Cys 


Glu 


Tyr 


He 


Glu 


Ala 


Glu 










335 








340 










345 


Glu 


Val 


Asp 


Thr 


Val 


Asp 


Val 


Ser Gly 


Val 


Pro 


Ala 


Glu 


Val 


Leu 










350 








355 










360 


Cys 


Asn 


He 


Glu 


Ala 


Asp 


Thr 


Tyr Trp 


Cys 


Met 


Ser Lys 


Leu 


Leu 










365 








370 










375 


Asp 


Gly 


He 


Gin 


Asp 


Asn 


Tyr 


Thr Phe 


Ala 


Gin 


Pro Gly 


He 


Gin 










380 








385 










390 


Met 


Lys 


Val 


Lys 


Met 


Leu 


Glu 


Glu Leu 


Val 


Ser 


Arg 


He 


Asp 


Glu 










395 








400 










405 


Gin 


Val 


His 


Arg 


His 


Leu 


Asp 


Gin His 


Glu 


Val 


Arg 


Tyr 


Leu 


Gin 










410 








415 










420 


Phe 


Ala 


Phe 


Arg 


Trp 


Met 


Asn 


Asn Leu 


Leu 


Met 


Arg Glu 


Val 


Pro 










425 








430 










435 


Leu 


Arg 


Cys 


Thr 


He 


Arg 


Leu 


Trp Asp 


Thr 


Tvr 


Gin 


Ser 


Glu 


Pro 










440 








445 










450 


Asp 


Gly 


Phe 


Ser 


His 


Phe 


His 


Leu Tyr 


Val 


Cys 


Ala 


Ala 


Phe 


Leu 










455 








460 










465 


Val 


Arg 


Trp 


Arg 


Lys 


Glu 


He 


Leu Glu 


Glu 


Lys 


Asp 


Phe 


Gin 


Glu 










470 








475 










480 


Leu 


Leu 


Leu 


Phe 


Leu 


Gin 


Asn 


Leu Pro 


Thr 


Ala 


His 


Trp 


Asp 


Asp 










485 








490 










495 


Glu 


Asp 


He 


Ser 


Leu 


Leu 


Leu 


Ala Glu 


Ala 


Tyr 


Arg 


Leu 


Lys 


Phe 










500 








505 










510 


Ala 


Phe 


Ala 


Asp 


Ala 


Pro 


Asn 


His Tyr 


Lys 


Lys 











515 520 

<210> 60 
<211> 751 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3349052CD1 

<400> 60 



Met 


Arg 


Leu 


Leu 


Gly 


Ala 


Ala 


Ala Val 


Ala 


Ala 


Leu Gly 


Arg 


Gly 


1 








5 








10 








15 


Arg 


Ala 


Pro 


Ala 


Ser 


Leu 


Gly 


Trp Gin 


Arg 


Lys 


Gin Val 


Asn 


Trp 










20 








25 








30 


Lys 


Ala 


Cys 


Arg 


Trp 


Ser 


Ser 


Ser Gly 


Val 


He 


Pro Asn 


Glu 


Lys 










35 








40 








45 


He 


Arg 


Asn 


He 


Gly 


He 


Ser 


Ala His 


He 


Asp 


Ser Gly 


Lys 


Thr 










50 








55 








60 


Thr 


Leu 


Thr 


Glu 


Arg 


Val 


Leu 


Tyr Tyr 


Thr 


Gly 


Arg He 


Ala 


Lys 










65 








70 








75 


Met 


His 


Glu 


Val 


Lys 


Gly 


Lys 


Asp Gly 


Val 


Gly 


Ala Val 


Met 


Asp 










80 








85 








90 


Ser 


Met 


Glu 


Leu 


Glu 


Arg 


Gin 


Arg Gly 


He 


Thr 


He Gin 


Ser 


Ala 










95 








100 








105 


Ala 


Thr 


Tyr 


Thr 


Met 


Trp 


Lys 


Asp Val 


Asn 


He 


Asn He 


He 


Asp 










110 








115 








120 


Thr 


Pro 


Gly 


His 


Val 


Asp 


Phe 


Thr He 


Glu 


Val 


Glu Arg 


Ala 


Leu 
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125 130 135 

Arg Val Leu Asp Gly Ala Val Leu Val Leu Cys Ala Val Gly Gly 
140 145 150 

Val Gin Cys Gin .Thr Met Thr Val Asn Arg Gin Met Lys Arg Tyr 
155 160 165 

Asn Val Pro Phe Leu Thr Phe He Asn Lys Leu Asp Arg Met Gly 
170 175 180 

Ser Asn Pro Ala Arg Ala Leu Gin Gin Met Arg Ser Lys Leu Asn 
185 190 195 

His Asn Ala Ala Phe Met Gin He Pro Met Gly Leu Glu Gly Asn 
200 205 210 

Phe Lys Gly He He Asp Leu lie Glu Glu Arg Ala He Tyr Phe 
215 220 225 

Asp Gly Asp Phe Gly Gin He Val Arg Tyr Gly Glu He Pro Ala 
230 235 240 

Glu Leu Arg Ala Ala Ala Thr Asp His Arg Gin Glu Leu He Glu 

245 250 255 

Cys Val Ala Asn Ser Asp Glu Gin Leu Gly Glu Met Phe Leu Glu 

260 265 270 

Glu Lys He Pro Ser lie Ser Asp Leu Lys Leu Ala lie Arg Arg 

275 280 285 

Ala Thr Leu Lys Arg Ser Phe Thr Pro Val Phe Leu Gly Ser Ala 

290 295 300 

Leu Lys Asn Lys Gly Val Gin Pro Leu Leu Asp Ala Val Leu Glu 

305 310 315 

Tyr Leu Pro Asn Pro Ser Glu Val Gin Asn Tyr Ala lie Leu Asn 

320 325 330 

Lys Glu Asp Asp Ser Lys Glu Lys Thr Lys He Leu Met Asn Ser 

335 340 345 

Ser Arg Asp Asn Ser His Pro Phe Val Gly Leu Ala Phe Lys Leu 

350 355 360 

Glu Val Gly Arg Phe Gly Gin Leu Thr Tyr Val Arg Ser Tyr Gin 

365 370 375 

Gly Glu Leu Lys Lys Gly Asp Thr He Tyr Asn Thr Arg Thr Arg 

380 385 390 

Lys Lys Val Arg Leu Gin Arg Leu .Ala Arg Met His Ala Asp Met 

395 400 405 

Met Glu Asp Val Glu Glu Val Tyr Ala Gly Asp lie Cys Ala Leu 

410 415 420 

Phe Gly He Asp Cys Ala Ser Gly Asp Thr Phe Thr Asp Lys Ala 

425 430 435 

Asn Ser Gly Leu Ser Met Glu Ser He His Val Pro Asp Pro Val 

440 445 450 

lie Ser lie Ala Met Lys Pro Ser Asn Lys Asn Asp Leu Glu Lys 

455 460 465 

Phe Ser Lys Gly lie Gly Arg Phe Thr Arg Glu Asp Pro Thr Phe 

470 475 480 

Lys Val Tyr Phe Asp Thr Glu Asn Lys Glu Thr Val lie Ser Gly 

485 490 495 

Met Gly Glu Leu His Leu Glu He Tyr Ala Gin Arg Leu Glu Arg 

500 505 510 

Glu Tyr Gly Cys Pro Cys He Thr Gly Lys Pro Lys Val Ala Phe 

515 520 525 

Arg Glu Thr He Thr Ala Pro Val Pro Phe Asp Phe Thr His Lys 

530 535 540 

Lys Gin Ser Gly Gly Ala Gly Gin Tyr Gly Lys Val He Gly Val 

545 550 555 

Leu Glu Pro Leu Asp Pro Glu Asp Tyr Thr Lys Leu Glu Phe Ser 

560 565 570 

Asp Glu Thr Phe Gly Ser Asn He Pro Lys Gin Phe Val Pro Ala 

575 580 585 

Val Glu Lys Gly Phe Leu Asp Ala Cys Glu Lys Gly Pro Leu Ser 

590 595 600 



61/115 



WO 01/05970 



PCT/USOO/19698 



Gly His 


Lys 


Leu 


Ser 


Gly 


Leu 


Arg Phe Val Leu Gin Asp 


Gly Ala 




605 






610 


615 


His His 


Met 


Val 


Asp 


Ser 


Asn 


Glu He Ser Phe He Arg 


Ala Gly 








620 






625 


630 


Glu Gly 


Ala 


Leu 


Lys 


Gin 


Ala 


Leu Ala Asn Ala Thr Leu 


Cys He 






635 






640 


645 


Leu Glu 


Pro 


He 


Met 


Ala 


Val 


Glu Val Val Ala Pro Asn 


Glu Phe 








650 






655 


660 


Gin Gly 


Gin 


Val 


He 


Ala 


Gly 


He Asn Arg Arg His Gly 


Val He 






665 




670 


675 


Thr Gly 


Gin Asp 


Gly 


Val 


Glu 


Asp Tyr Phe Thr Leu Tyr 


Ala Asp 






680 






685 


690 


Val Pro 


Leu 


Asn 


Asp 


Met 


Phe 


Gly Tyr Ser Thr Glu Leu 


Arg Ser 








695 






700 


705 


Cys Thr 


Glu Gly 


Lys 


Gly 


Glu 


Tyr Thr Met Glu Tyr Ser 


Arg Tyr 






710 






/ID 


/ Z \J 


Gin Pro 


Cys 


Leu 


Pro 


Ser 


Thr 


Gin Glu Asp Val He Asn 


Lys Tyr 






725 






730 


735 


Leu Glu 


Ala 


Thr 


Gly 


Gin 


Leu 


Pro Val Lys Lys Gly Lys 


Ala Lys 








740 






745 


750 


Asn 
















<210> 61 














<211> 666 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


3357264CD1 




<220> 
















<221> unsure 












<222> 281 














<223> unknown or other 








<400> 61 














Met Cys 


Gly 


Ala 


Val 


He 


Pro 


Leu His Lys Pro Ala Gly Arg Lys 


1 






5 






10 


15 


Leu Gin 


Asn 


Gin 


Arg 


Ala 


Ala 


Leu Asn Gin Gin He Leu 


Lys Ala 








20 






25 


30 


Val Arg 


Met 


Arg 


Thr 


Gly 


Ala 


Glu Asn Leu Leu Lys Val Ala Thr 






35 






40 


45 


Asn Ser 


Lys 


Val 


Arg 


Glu 


Gin 


Val Arg Leu Glu Leu Ser 


Phe Val 








50 






55 


60 


Asn Ser 


Asp 


Leu 


Gin 


Met 


Leu 


Lys Glu Glu Leu Glu Gly Leu Asn 








65 






70 


75 


lie Ser 


Val 


Gly 


Val 


Tyr 


Gin 


Asn Thr Glu Glu Ala Phe 


Thr He 






80 






85 


90 


Pro Leu 


He 


Pro 


Leu 


Gly 


Leu 


Lys Glu Thr Lys Asp Val Asp Phe 








95 






100 


105 


Ala Val 


Val 


Leu 


Lys 


Asp 


Phe 


He Leu Glu His Tyr Ser Glu Asp 








110 






115 


120 


Gly Tyr 


Leu 


Tyr 


Glu 


Asp 


Glu 


He Ala Asp Leu Met Asp Leu Arg 






125 






130 


135 


Gin Ala 


Cys 


Arg 


Thr 


Pro 


Ser 


Arg Asp Glu Ala Gly Val Glu Leu 








140 






145 


150 


Leu Met 


Thr 


Tyr 


Phe 


He 


Gin 


Leu Gly Phe Val Glu Ser 


Arg Phe 








155 






160 


165 


Phe Pro 


Pro 


Thr 


Arg 


Gin 


Met 


Gly Leu Leu Phe Thr Trp 


Tyr Asp 








170 






175 


180 


Ser Leu 


Thr 


Gly 


Val 


Pro 


Val 


Ser Gin Gin Asn Leu Leu 


Leu Glu 








185 






190 


195 
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Lys Ala Ser Val Leu Phe Asn Thr Gly Ala Leu Tyr Thr Gin He 
200 205 210 

Gly Thr Arg Cys Asp Arg Gin Thr Gin Ala Gly Leu Glu Ser Ala 
215 220 225 

He Asp Ala Phe Gin Arg Ala Ala Gly Val Leu Asn Tyr Leu Lys 
230 235 240 

Asp Thr Phe Thr His Thr Pro Ser Tyr Asp Met Ser Pro Ala Met 
245 250 255 

Leu Ser Val Leu Val Lys Met Met Leu Ala Gin Ala Gin Glu Ser 
260 265 270 

Val Phe Glu Lys He Ser Leu Pro Gly He Xaa Asn Glu Phe Phe 
275 280 285 

Met Leu Val Lys Val Ala Gin Glu Ala Ala Lys Val Gly Glu Val 
290 295 300 

Tyr Gin Gin Leu His Ala Ala Met Ser Gin Ala Pro Val Lys Glu 
305 310 315 

Asn He Pro Tyr Ser Trp Ala Ser Leu Ala Cys Val Lys Ala His 
320 325 330 

His Tyr Ala Ala Leu Ala His Tyr Phe Thr Ala He Leu Leu He 
335 340 345 

Asp His Gin Val Lys Pro Gly Thr Asp Leu Asp His Gin Glu Lys 
350 355 360 

Cys Leu Ser Gin Leu Tyr Asp His Met Pro Glu Gly Leu Thr Pro 
365 370 375 

Leu Ala Thr Leu Lys Asn Asp Gin Gin Arg Arg Gin Leu Gly Lys 
380 385 390 

Ser His Leu Arg Arg Ala Met Ala His His Glu Glu Ser Val Arg 
395 400 405 

Glu Ala Ser Leu Cys Lys Lys Leu Arg Thr He Glu Val Leu Gin 
410 415 420 

Lys Val Leu Cys Ala Ala Gin Glu Arg Ser Arg Leu Thr Tyr Ala 
425 430 435 

Gin His Gin Glu Glu Asp Asp Leu Leu Asn Leu He Asp Ala Pro 
440 445 450 

Ser Val Val Ala Lys Thr Glu Gin Glu Val Asp He He Leu Pro 
455 460 465 

Gin Phe Ser Lys Leu Thr Val Thr Asp Phe Phe Gin Lys Leu Gly 
470 475 480 

Pro Leu Ser Val Phe Ser Ala Asn Lys Arg Trp Thr Pro Pro Arg 
485 490 495 

Ser He Arg Phe Thr Ala Glu Glu Gly Asp Leu Gly Phe Thr Leu 
500 505 510 

Arg Gly Asn Ala Pro Val Gin Val His Phe Leu Asp Pro Tyr Cys 
515 520 525 

Ser Ala Ser Val Ala Gly Ala Arg Glu Gly Asp Tyr He Val Ser 
530 535 540 

He Gin Leu Val Asp Cys Lys Trp Leu Thr Leu Ser Glu Val Met 
545 550 555 

Lys Leu Leu Lys Ser Phe Gly Glu Asp Glu He Glu Met Lys Val 
560 565 570 

Val Ser Leu Leu Asp Ser Thr Ser Ser Met His Asn Lys Ser Ala 
575 580 585 

Thr Tyr Ser Val Gly Met Gin Lys Thr Tyr Ser Met He Cys Leu 
590 595 600 

Ala He Asp Asp Asp Asp Lys Thr Asp Lys Thr Lys Lys He Ser 
605 610 615 

Lys Lys Leu Ser Phe Leu Ser Trp Gly Thr Asn Lys Asn Arg Gin 
620 625 630 

Lys Ser Ala Ser Thr Leu Cys Leu Pro Ser Val Gly Ala Ala Arg 
635 1 640 645 

Pro Gin Val Lys Lys Lys Leu Pro Ser Pro Phe Ser Leu Leu Asn 
650 655 660 

Ser Asp Ser Ser Trp Tyr 
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<210> 62 
<211> 746 
<212> PRT 
<213> Homo 



665 



sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3576329CD1 



<400> 62 

Met Ala Gly Ser Arg Gly Ala Gly Arg Thr Ala Ala Pro Ser Val 
15 10 15 

Arg Pro Glu Lys Arg Arg Ser Glu Pro Glu Leu Glu Pro Glu Pro 

20 25 30 

Glu Pro Glu Pro Pro Leu Leu Cys Thr Ser Pro Leu Ser His Ser 

35 40 45 

Thr Gly Ser Asp Ser Gly Val Ser Asp Ser Glu Glu Ser Val Phe 
50 55 60 

Ser Gly Leu Glu Asp Ser Gly Ser Asp Ser Ser Glu Asp Asp Asp 
65 70 75 

Glu Gly Asp Glu Glu Gly Glu Asp Gly Ala Leu Asp Asp Glu Gly 
80 85 90 

His Ser Gly He Lys Lys Thr Thr Glu Glu Gin Val Gin Ala Ser 
95 100 105 

Thr Pro Cys Pro Arg Thr Glu Met Ala Ser Ala Arg He Gly Asp 

110 115 120 

Glu Tyr Ala Glu Asp Ser Ser Asp Glu Glu Asp He Arg Asn Thr 

125 130 135 

Val Gly Asn Val Pro Leu Glu Trp Tyr Asp Asp Phe Pro His Val 

140 145 150 

Gly Tyr Asp Leu Asp Gly Arg Arg He Tyr Lys Pro Leu Arg Thr 

155 160 165 

Arg Asp Glu Leu Asp Gin Phe Leu Asp Lys Met Asp Asp Pro Asp 

170 175 180 

Tyr Trp Arg Thr Val Gin Asp Pro Met Thr Gly Arg Asp Leu Arg 

185 190 195 

Leu Thr Asp Glu Gin Val Ala Leu Val Arg Arg Leu Gin Ser Gly 

200 205 210 

Gin Phe Gly Asp Val Gly Phe Asn Pro Tyr Glu Pro Ala Val Asp 

215 220 225 

Phe Phe Ser Gly Asp Val Met He His Pro Val Thr Asn Arg Pro 

230 235 240 

Ala Asp Lys Arg Ser Phe He Pro Ser Leu Val Glu Lys Glu Lys 

245 250 255 

Val Ser Arg Met Val His Ala He Lys Met Gly Trp He Gin Pro 

260 265 270 

Arg Arg Pro Arg Asp Pro Thr Pro Ser Phe Tyr Asp Leu Trp Ala 

275 280 285 

Gin Glu Asp Pro Asn Ala Val Leu Gly Arg His Lys Met His Val 

290 295 300 

Pro Ala Pro Lys Leu Ala Leu Pro Gly His Ala Glu Ser Tyr Asn 

305 310 315 

Pro Pro Pro Glu Tyr Leu Leu Ser Glu Glu Glu Arg Leu Ala Trp 

320 325 330 

Glu Gin Gin Glu Pro Gly Glu Arg Lys Leu Gly Phe Leu Pro Arg 

335 340 345 

Lys Phe Pro Ser Leu Arg Ala Val Pro Ala Tyr Gly Arg Phe He 

350 355 360 

Gin Glu Arg Phe Glu Arg Cys Leu Asp Leu Tyr Leu Cys Pro Arg 

365 370 375 

Gin Arg Lys Met Arg Val Asn Val Asp Pro Glu Asp Leu He Pro 

380 385 390 
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Lys Leu Pro Arg Pro Arg Asp Leu Gin 
395 

Ala Leu Val Tyr Arg Gly His Ser Asp 
410 

Val Ser Pro Gly Gly Gin Trp Leu Val 
425 

Ser Leu Arg Leu Trp Glu Val Ala Thr 
440 

Val Pro Val Gly Gly Val Val Lys Ser 
455 

Pro Ala Val Cys Leu Val Ala Ala Ala 
470 

Leu Leu Asn Pro Ala Leu Gly Asp Arg 
485 

Asp Gin Leu Leu Ser Ala Phe Val Pro 
500 

Gin Pro Ala Arg Trp Leu Glu Ala Ser 
515 

Gly Leu Arg Leu Arg lie Cys His Gly 
530 

Thr Trp His Gly Arg Gly Asp Tyr Leu 
545 

Gin Gly His Thr Gin Val Leu lie His 
560 

Ser Gin Ser Pro Phe Arg Arg Ser His 
575 

Ala Phe His Pro Ala Arg Pro Phe Leu 
590 

Ser Val Arg Leu Tyr His Leu Leu Arg 
605 

Leu Met Pro Asn Cys Lys Trp Val Ser 
620 

Ala Gly Asp Asn Val He Cys Gly Ser 
635 

Trp Phe Asp Leu Asp Leu Ser Thr Lys 
650 

His His Lys Lys Ala Leu Arg Ala Val 
665 

Pro Leu Phe Ala Ser Gly Ser Asp Asp 
680 

His Gly Met Val Tyr Asn Asp Leu Leu 
695 

Pro Val Lys Val Leu Lys Gly His Val 
710 

Val Leu Asp Val He Phe His Pro Thr 
725 

Ser Gly Ala Asp Gly Thr Val Arg Leu 
740 

<210> 63 

<211> 212 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3805550CD1 

<400> 63 

Met Ala Gly Pro Gly Pro Gly Pro Gly 

1 5 
Asp Phe Leu Phe Lys Leu Val Leu Val 
20 

Lys Thr Cys Val Val Gin Arg Phe Lys 



PCT/US00/19698 



Pro Phe Pro Thr 


Cys Gin 


400 


405 


Leu Val Arg Cys 


Leu Ser 


415 


420 


Ser Gly Ser Asp Asp Gly 


430 


435 


Ala Arg Cys Val Arg Thr 


445 


450 


Val Ala Trp Asn 


Pro Ser 


460 


465 


Val Glu Asp Ser 


Val Leu 


475 


480 


Leu Val Ala Gly Ser Thr 


490 


495 


Pro Glu Glu Pro 


Pro Leu 


505 


510 


Glu Glu Glu Arg Gin Val 


520 


525 


Lys Pro Val Thr 


Gin Val 


535 


540 


Ala Val Val Leu 


Ala Thr 


550 


555 


Gin Leu Ser Arg Arg Arg 


565 


570 


Gly Gin Val Gin 


Arg Val 


580 


585 


Leu Val Ala Ser 


Gin Arg 


595 


600 


Gin Glu Leu Thr 


Lys Lys 


610 


615 


Ser Leu Ala Val 


His Pro 


625 


630 


Tyr Asp Ser Lys 


Leu Val 


640 


645 


Pro Tyr Arg Met 


Leu Arg 


655 


660 


Ala Phe His Pro 


Arg Tyr 


670 


675 


Gly Ser Val He 


Val Cys 


685 


690 


Gin Asn Pro Leu 


Leu Val 


700 


705 


Leu Thr Arg Asp 


Leu Gly 


715 


720 


Gin Pro Trp Val 


Phe Ser 


730 


735 


Phe Thr 




745 





Asp Pro Asp Glu Gin Tyr 
10 15 

Gly Asp Ala Ser Val Gly 
25 30 

Thr Gly Ala Phe Ser Glu 
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35 40 45 

Arg Gin Gly Ser Thr lie Gly Val Asp Phe Thr Met Lys Thr Leu 
50 55 60 

Glu lie Gin Gly Lys Arg Val Lys Leu Gin He Trp Asp Thr Ala 
65 70 75 

Gly Gin Glu Arg Phe Arg Thr He Thr Gin Ser Tyr Tyr Arg Ser 
80 85 90 

Ala Asn Gly Ala He Leu Ala Tyr Asp He Thr Lys Arg Ser Ser 
95 100 105 

Phe Leu Ser Val Pro His Trp He Glu Asp Val Arg Lys Tyr Ala 
110 115 120 

Gly Ser Asn He Val Gin Leu Leu He Gly Asn Lys Ser Asp Leu 
125 130 135 

Ser Glu Leu Arg Glu Val Ser Leu Ala Glu Ala Gin Ser Leu Ala 
140 145 150 

Glu His Tyr Asp He Leu Cys Ala He Glu Thr Ser Ala Lys Asp 
155 160 165 

Ser Ser Asn Val Glu Glu Ala Phe Leu Arg Val Ala Thr Glu Leu 
170 175 180 

He Met Arg His Gly Gly Pro Leu Phe Ser Glu Lys Ser Pro Asp 
185 190 195 

His He Gin Leu Asn Ser Lys Asp He Gly Glu Gly Trp Gly Cys 
200 205 210 

Gly Cys 

<210> 64 

<211> 307 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4546403CD1 



<400> 64 
Met Arg Cys 
1 

Cys Cys Asp 
Leu Gin Phe 
Lys He Trp 
Lys Tyr Lys 
Cys Ala Phe 
Asp Lys Ser 
His Thr Leu 
Ala Pro Asn 
Val Asn He 
Ser Thr Glu 
Glu Asp Val 
Val Gly He 
Asn Leu Thr 



Leu 


His 


Ser 




5 




Phe 


Ser 


Ser 




20 




Phe 


Arg 


Leu 




35 




He 


Val 


Ser 




50 




Ser 


Thr 


Leu 




65 




Ser 


His 


Asp 




80 




Val 


He 


val 




95 




Thr 


Gin 


His 




110 




Thr 


Leu 


Leu 




125 




Trp 


Gin 


Phe 




140 




His 


Gin 


Leu 




155 




Ser 


Thr 


Trp 




170 




Phe 


Lys 


Met 




185 




Lys 


Glu 


Ser 




200 





Glu Lys Ala 
Gin Pro Val 
Ala Ser Cys 
Phe Thr His 
Ser Gly His 
Gly Gin Met 
Tyr Asp Thr 
Thr Arg Tyr 
Leu Ala Thr 
Asp Leu Glu 
Lys Gin Phe 
Leu Cys Ala 
Asn Asn He 
Leu Ala Asp 



His Asp Leu 
10 

Ser Asp Gly 
25 

Gly Gin Asp 
40 

He Leu Gly 
55 

Cys Ala Pro 
70 

Leu Val Ser 
85 

Asn Thr Glu 
100 

Val Thr Thr 
115 

Gly Ser Met 
130 

Thr Leu Cys 
145 

Thr Glu Asp 
160 

Gin Asp Leu 
175 

Asp Gly Lys 
190 

Asp Leu Lys 
205 



Gly He Thr 
15 

Glu Gin Gly 
30 

Cys Gin Val 
45 

Phe Glu Leu 
60 

Val Leu Ala 
75 

Gly Ser Val 
90 

Asn He Leu 
105 

Cys Ala Phe 
120 

Asp Lys Thr 
135 

Gin Ala Arg 
150 

Trp Ser Glu 
165 

Lys Asp Leu 
180 

Glu Leu Leu 
195 

He Glu Ser 
210 
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Leu Gly 


Leu 




Cor T ,vc \/a 1 










9 1 R 




1 IIX Ajy 5 


val 


Lys 


OCI JJCU OCi 


Ser Gly 








o ^ n 




Cys Pro 


lie 


rpVi -w- 

lnx 


Arg oiu Jjcu 












Asp Gly 


Tyr 


Cor 


THry f^lii TiVC 
i y j. oiu uy & 


Glu Ala 






<£ V w 




Lys Lys 


Lys 


Arg 


TViy* Q^t* "Pi-rt 


Met Thr 








275 




Ala Val 


Leu 


Thr 


Pro Asn Arg 


Thr Leu 








290 




Trp Leu 


Glu 


Thr 


His Gin Lys 










305 




<210> 65 









<211> 378 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4767318CD1 

<400> 65 

Met Arg Ala Ala Ala Ala Pro Gly Leu 

1 5 
Leu Gin Cys Cys Glu Leu Glu Ala Gly 
20 

Pro Ala Ala Ala Met Gly Pro Ser Ala 
35 

Gly Ser Met Ala Pro Trp Cys Ser Val 
50 

Tyr Val Leu Gly Met Gin Glu Leu Phe 
65 

Arg Glu Phe Leu Ala His Ser Ala Lys 
80 

Ser Cys Asp Gly Arg Arg Leu Ala Ser 
95 

Ala Ser Val Phe Leu Leu Glu Lys Asp 
110 

Asn Tyr Arg Gly His Gly Asp Ser Val 
125 

Pro Ser Asn Pro Asp Leu Phe Val Thr 
140 

lie Arg lie Trp Asp Val Arg Thr Thr 
155 

Asn Thr Lys Gly Glu Asn lie Asn lie 
170 

Gin Thr lie Ala Val Gly Asn Lys Asp 
185 

Asp Ala Lys Thr His Arg Ser Lys Ala 
200 

Glu Val Asn Glu He Ser Trp Asn Asn 
215 

Leu Thr Asn Gly Asn Gly Cys He Asn 
230 

Leu Lys Pro Val Gin Ser He Asn Ala 
245 

Cys He Lys Phe Asp Pro Met Gly Lys 
260 

Ala Asp Ala Leu Val Ser Leu Trp Asp 
275 

Val Arg Cys Phe Ser Arg Leu Asp Trp 



PCTYUS00/19698 



Lys 


Tl ia 
J-lc 


UlU 


Glu 


Leu Arg 


220 








225 


lie 


Pro 


_ 

ASp 


Glu 


Phe He 


9^5 

£t J J 








240 


Asp 




Va 1 

V Gil 


He 


Ala Ser 


250 








255 


Met 


Glu 


Asn 


Trp 


He Ser 


265 








270 


Asn 


Leu 


Val 


Leu 


Pro Ser 


280 








285 


Lys 


Met 


Ala 


He 


Asn Arg 


295 








300 



Thr 


Ala Pro Trp Arg 


Leu 


10 




15 


Glu 


Leu Gly Met Ala 


Val 


25 




30 


Leu Gly Gin Ser Gly Pro 


40 




45 


Ser 


Ser Gly Pro Ser 


Arg 


55 




60 


Arg Gly His Ser Lys 


Thr 


70 




75 


Val 


His Ser Val Ala 


Trp 


85 




90 


Gly 


Ser Phe Asp Lys 


Thr 


100 




105 


Arg 


Leu Val Lys Glu 


Asn 


115 




120 


Asp Gin Leu Cys Trp His 


130 




135 


Ala 


Ser Gly Asp Lys 


Thr 


145 




150 


Lys 


Cys He Ala Thr 


Val 


160 




165 


Cys 


Trp Ser Pro Asp Gly 


175 




180 


Asp 


Val Val Thr Phe 


He 


190 




195 


Glu 


Glu Gin Phe Lys 


Phe 


205 




210 


Asp Asn Asn Met Phe 


Phe 


220 




225 


He 


Leu Ser Tyr Pro 


Glu 


235 




240 


His 


Pro Ser Asn Cys 


He 


250 




255 


Tyr 


Phe Ala Thr Gly Ser 


265 




270 


Val 


Asp Glu Leu Val 


Cys 


280 




285 


Pro 


Val Arg Thr Leu 


Ser 
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290 






295 




Phe 


Ser 


His 


Asp 


Gly 
305 


Lys 


Met Leu Ala 


Ser 
310 


Ala 


Phe 


He 


Asp 


He 


Ala 

320 


Glu 


Val Glu Thr 


Gly 
325 


Asp 


Val 


Gin 


Cys 


Glu 


Ser 
335 


Pro 


Thr Phe Thr 


Val 
340 


Ala 


Arg 


Pro 


Leu 


Leu 


Ala 
350 


Phe 


Ala Cys Asp 


Asp 
355 


Lys 


Asp 


Ser 


Ser 


Arg 


Glu 
365 


Ala 


Gly Thr Val 


Lys 
370 


Leu 



Asn Asp Ser 

<210> 66 

<211> 466 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4834527CD1 

<400> 66 



I1C L. 


Pro 




± ILL 




OCX 


Al a 
Aid 


Coy A^O 


Met 


Val 


1 








5 








10 




JuCU 


Ser 


Pro 

IrX \J 


Pro 

IrX \J 


Z X \J 

20 


Thr 


Glu 


Pro Thr 


Asp 
25 


Glv 


Gin 


XT X \J 


Leu 


Leu 


Asp 
35 


vsx y 


Ala 


Pro Ser 


Ser 
40 


Ala 


i-itr U. 


Tie 

X lc 


O Hi 


Hi c; 


50 


Val 


Pro 

IT X \J 


Thr Al a 

xi xx nia 


Asp 
55 




Ala 


Tvr 


He 


Phe 


Thr 


Phe 


Leu 


Leu Ser 


Ser 


Arg 






Giu 




65 








70 




IT X \J 


Am 

ax y 


Leu 


Leu 
80 


Ala 


Arcr 
y 


Val Cys 


His 
85 


Leu 


Gin 


Gin 


Leu 


Asp 


Lys 
95 


Pro 


Val 


Leu Asp 


Lys 
100 


Ala 


Phe 


Gly 


Pro 


Lys 


Leu 
110 


Leu 


Gin 


Leu Leu 


Ala 
115 


Glu 


Phe 


Pro 


Arg 


Asp 


Phe 
125 


Gin 


Glu 


Glu Ser 


Thr 
130 


lie 


Asp 


Val 


Val 


Gly 


Arg 
140 


He 


Ala 


Pro Cys 


Asp 
145 


Glu 


Arg 


Met 


His 


Gin 


Leu 
155 


Leu 


Gin 


Ala Leu 


His 
160 


Gin 


Leu 


Arg 


Gin 


Gly 


Pro 
170 


Glu 


Gly 


Leu Val 


Gly 
175 


Ala 


Ser 


Tyr 


Arg 


Thr 


Lys 
185 


Pro 


Pro 


Ala Ser 


He 
190 


His 


Gly 


Val 


Cys 


Ser 


Asp 
200 


Pro 


Tyr 


Thr Leu 


Ala 
205 


Gin 


Val 


Glu 


Leu 


Glu 


Arg 
215 


Leu 


Arg 


His He 


Gly 
220 


Pro 


Gin 


Ala 


Phe 


Val 


Asn 
230 


Lys 


Asp 


Pro Leu 


Ala 
235 


Ser 


Phe 


Ser 


Asp 


Lys 


Thr 
245 


Ser 


Asn 


Leu Glu 


Ala 

250 


Tyr 


Asn 


Arg 


Leu 


Cys 


Tyr 
260 


Leu 


Val 


Ala Thr 


Glu 
265 


He 


Lys 


Lys 


Lys 


Gin 


Arg 
275 


Ala 


Gin 


Val He 


Glu 
280 


Phe 


Ala 


Arg 


Glu 


Cys 


Phe 
290 


Asn 


He 


Gly Asn 


Phe 
295 


Asn 



PCT/US00/19698 







300 


Ser Glu 


Asp 


His 






315 


Lys Leu 


Trp 


Glu 






330 


Trp His 


Pro 


Lys 






345 


Asp Gly 


Lys 


Tyr 






360 


Phe Gly 


Leu 


Pro 






375 



Thr 


Pro 


<jxy 


Cpr 
OCX 








15 


Glu 


Gin 


Ala 


Glv 








30 


Ser 


Leu 


Glu 


Thr 








45 


Tyr 


Pro 


Glu 


Lys 








60 


Leu 


Phe 


lie 


Glu 








75 


Cys 


He 


Glu 


Gin 








90 


Arg 


Val 


Arg 


Lys 








105 


Trp 


Thr 


Glu 


Thr 








120 


Gly His 


Leu 


Lys 








135 


Ala 


Tyr 


Arg 


Lys 








150 


Lys 


Leu 


Ala 


Ala 








165 


Asp 


Lys 


Pro 


lie 








180 


Arg Glu 


Leu 


Leu 








195 


Gin 


Leu 


Thr 


His 








210 


Glu 


Glu 


Phe 


Val 








225 


Thr 


Lys 


Pro 


Cys 








240 


Val 


Lys 


Trp 


Phe 








255 


Cys 


Met 


Pro 


Ala 








270 


Phe 


He 


Asp 


Val 








285 


Ser 


Leu 


Met 


Ala 








300 
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He 


He 


Ser Gly 


Met 








305 


Thr 


Tro 


Ala Lys 


Val 








320 


Gin 


Met 


Asp Pro 


Thr 








335 


Arg 


Glv 


Ala Ala 


His 








350 


Lys 


He 


Val He 


Pro 








365 


Phe 


Leu 


Asn Glu 


Glv 








380 


Asn 


Phe 


Glu Lvs 


Phe 








395 


He 


Thr 


Trp Lys 


Gin 








410 


He 


Thr 


His Tyr 


Leu 








425 


Leu 


Tyr 


Leu Ala 


Ser 








440 


Glu 


Lys 


Glu Arg 


Trp 








455 


Thr 









Asn 


Met 


Ser 


Pro 


Val 
310 


Arg 


Thr 


Ala 


Lys 


Phe 
325 


Gly 


Asn 


Phe 


Cys 


Asn 
340 


Arg 


Ser 


Leu 


Thr 


Ala 
355 


Phe 


Phe 


Ser 


Leu 


Leu 
370 


Cys 


Ala 


Asn Arg 


Leu 










385 


Leu 


Glu 


Leu Ala Lys 










400 


Val 


Glu 


Cys 


Pro 


Phe 
415 


Tyr 


Thr 


Ala 


Pro 


He 
430 


Tyr 


Glu 


Ser 


Glu 


Ser 
445 


Lys 


Ala 


Leu 


Arg 


Ser 
460 



Ser 


Arg 


Leu 


Lys 


Lys 
315 


Phe 


He 


Leu 


Glu 


His 
330 


Tyr 


Arg 


Thr 


Ala 


Leu 
345 


His 


Ser 


Ser 


Arg 


Glu 
360 


He 


Lys 


Asp 


He 


Tyr 
375 


Pro 


Asn 


Gly 


His 


Val 
390 


Gin 


Val 


Gly 


Glu 


Phe 
405 


Glu 


Gin 


Asp 


Ala 


Ser 
420 


Phe 


Ser 


Glu 


Asp 


Gly 
435 


Pro 


Glu 


Asn 


Gin 


Thr 
450 


Ser 


He 


Leu 


Gly 


Lys 
465 



<210> 67 

<211> 891 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1405545CB1 



<400> 67 

ggagaatggc ggcgccgggc tgcggctggg 
gctagagcga ctgagccgct ataatagcac 
gccgaagcag cagctgagga agctgctgta 
cccctacggg aggcaagacc ttcacctgcg 
cagggcggac aacatcttca cggccactga 
cgtccgtatc gaccacgccc cggaccttcc 
caatgttgga aaatcatctc taatcaaggc 
cagagtctcc aaaaaaccag gacacacaaa 
ttttacagtg gtggacatgc caggttatgg 
ggtagagacc tatctaaaag aacgaaggaa 
cgttgttgga attcaaaaaa cagacaatat 
accttatgtg attgtattaa caaaaattga 
agtgcttcag atccagaaat ttgttaacat 
tcctgtaagt gctgtgacct tttctggaat 
aacaggaagt cttgactaat ggttcccggt 



agcgggaaga ctctttgaaa tgcctgcggt 60 
gtcccaagct tttgctgagg tgctgcggct 120 
cccgctgcag gaagtagagc ggttcctcgc 180 
tatctttgac ccaagcccgg aggacatagc 240 
acggaaccgc atcgactacg tcagctccgc 300 
gcggccagag gtgtgtttta taggcagaag 360 
tttattttca ctggcccctg aggttgaagt 42 0 
gaaaatgaat tttttcaaag ttggaaaaca 480 
ctttagagca cctgaagatt ttgttgacat 540 
cttgaagaga acatttttat tagtggatag 600 
tgccatagaa atgtgtgaag aatttgcatt 660 
caaatcttcc aagggacatc ttttaaaaca 720 
gaaaactcaa ggatgttttc ctcagttgtt 780 
ccacctgttg agatgcttta tagccagtgt 840 
ttagctgaag attcaaaaaa a 891 



<210> 68 

<211> 1512 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1451265CB1 



<400> 68 

gcccatggag gtggccgtgt gtacggactc 
ggaacttcac tcgggcgcca acctgctcac 
cctggcgctg ctcaatggcg agtatctgct 



ggcggccccg atgtggagct gcatcgtgtg 60 
ctaccgcggc ggccaggcgg gaccccgcgg 120 
ggcggcgcag ctgggcaaga attacatcag 180 
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cgcctgggag ctccagcgga 
cacctgtctg actgcatcac 
ccacctgtgg gaggtctcca 
cgtctcctgc cttcagttca 
cctggtgctg gtttggagcc 
gcccaggcac gtctggtctc 
gggccccctg gcccgggtgg 
ctcctcgggg gagctgctgc 
ggacctggct gagcaccata 
cctcttcacc tggcccggac 
ggtcttcaaa gggcacagga 
gctgctctca ggctcccacg 
catccggacg gtggccctca 
cagcatgctg agctcagact 
gctgggcgcc gagcacgggg 
ccaccagcag ggctcggagc 
gtgcagcacc atggagaaga 
ggagctggag gacgaggtgc 
cacgcgcttc atcacgcggc 
ggcctgagcc ccatgcctcc 
gcctggactc tcctcagttc 
tggtgtctcg tggcacgcgt 
aaaaaaaaaa aa 

<210> 69 
<211> 2536 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1556311CB1 

<400> 69 

caactcttgt tgaagctttt aggcgtcgca gactcttcat ttgtgagggc gacctctccc 60 
gaggggctct tttcacacaa atatccccac ggcgtttctc ggaggcaccc ccgtcatacg 120 
tcttgtctct cgcgacaatt ctctttgaag gcgaggcatt tcaccacaac tcttttcaac 180 
caacctggcg acaacaccca gagcttacat tgaccccaat ttgaattttc atcggcccaa 240 
ggctttcttt acactcaggg aactctcaca ctccttaggg ggaaaaaggc ttcgttaagg 300 
gccttgcaag ggttaccggg ttccggaatt ttcccggggg cccctcggct ggccaggact 360 
gaaacccaga cgagcatgcc agaaacagtc aaccataaca aacatgggaa cgtagctctc 420 
cctggaacga aaccaactcc catccctcca ccccggctga agaagcaggc ttcttttctg 480 
gaagcagagg gcggtgcaaa gaccttgagc ggcggccggc cgggcgcagg cccggagctg 540 
gagctgggca cagctggcag cccaggtggg gccccgcctg aggccgcccc gggggattgc 600 
acaagggccc cgccgcccag ctctgaatca cggcccccgt gccatggagg ccggcagcgg 660 
ctgagcgaca tgagcatttc tacttcctcc tccgactcgc tggagttcga ccggagcatg 720 
cctctgtttg gctacgaggc ggacaccaac agcagcctgg aggactacga gggggaaagt 780 
gaccaagaga ccatggcgcc ccccatcaag tccaaaaaga aaaggagcag ctccttcgtg 840 
ctgcccaagc tcgtcaagtc ccagctgcag aaggtgagcg gggtgttcag ctccttcatg 900 
accccggaga agcggatggt ccgcaggatc gccgagcttt cccgggacaa atgcacctac 960 
ttcgggtgct tagtgcagga ctacgtgagc ttcctgcagg agaacaagga gtgccacgtg 1020 
tccagcaccg acatgctgca gaccatccgg cagttcatga cccaggtcaa gaactatttg 1080 
tctcagagct cggagctgga cccccccatc gagtcgctga tccctgaaga ccaaatagat 1140 
gtggtgctgg aaaaagccat gcacaagtgc atcttgaagc ccctcaaggg gcacgtggag 1200 
gccatgctga aggactttca catggccgat ggctcatgga agcaactcaa ggagaacctg 1260 
cagcttgtgc ggcagaggaa tccgcaggag ctgggggtct tcgccccgac ccctgatttt 1320 
gtggatgtgg agaaaatcaa agtcaagttc atgaccatgc agaagatgta ttcgccggaa 1380 
aagaaggtca tgctgctgct gcgggtctgc aagctcattt acacggtcat ggagaacaac 1440 
tcagggagga tgtatggcgc tgatgacttc ttgccagtcc tgacctatgt catagcccag 1500 
tgtgacatgc ttgaattgga cactgaaatc gagtacatga tggagctcct agacccatcg 1560 
ctgttacatg gagaaggagg ctattacttg acaagcgcat atggagcact ttctctgata 1620 
aagaatttcc aagaagaaca agcagcgcga ctgctcagct cagaaaccag agacaccctg 1680 
aggcagtggc acaaacggag aaccaccaac cggaccatcc cctctgtgga cgacttccag 1740 



aggaccagct ccagcagaag 
ccaatggtct ctacgtcctg 
ccgggaacct tctggtcatc 
caggggacag cagccacttc 
tctgcagcgt gctgcaggcc 
accacacgct ccccatcacg 
ccacctcctc actggaccag 
tctccgtcct ctttgacgtg 
tgttctgcgg gggcagtgag 
agagggagag gagcttccac 
accaggtgac ttgcctgtca 
acgagaccgt gcgcctctgg 
aaggcccagt caccaatgcc 
tcaggcccag cctgccgctg 
acgagccgcg ccacgggggc 
ccagctacct ggaccgcacg 
gcgtgctcgg cggccaggac 
gcaacctgcg caagatcaat 
cggccaagtg aggcccggag 
cagcaaccag ggcccgcggg 
tgtgtcgtgt tcgggttttt 
cacagtggtg ctagtctgtt 



atcatgtgcc ccgggcctgt 240 
gcaggagttg cagaaagcat 300 
ctgagtcgac actaccagga 360 
atctcagggg gcaaggactg 420 
gacccctcca ggattccggc 480 
gacctgcact gcggctttgg 540 
acggtgaagc tatgggaggt 600 
tccatcatgg cagtgaccat 660 
ggctccatct tccaggtcga 720 
ccagagcagg acgccgggaa 780 
gtgtccactg acggcagcgt 840 
gacgtgcaga gcaagcagtg 900 
gccatcctgc tggcgcccgt 960 
ccccacttca acaagcacct 1020 
ctcactctgc gcctgggcct 1080 
gagcagctgc aggccgtcct 1140 
cagctgcgcg tccgtgtgac 1200 
cgggacctgt tcgacttctc 1260 
accccggccc gaggcgccca 1320 
tgtggccccc accagcccag 1380 
cctctgtgac tgggccgtct 1440 
tttaacaaaa gaggatgaaa 1500 

1512 
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aattacctcc gagttgcatt tcaggaggtc 
gtgagacctt acatcaccac tgaggatgtg 
ggggaccctg aggagtacag cctctttctc 
gaggacactt accctcaaaa aatcaaggcg 
ttccactttg tctacaaacg catcaagaac 
gaagaagacc tcaccacctc ctagaagaca 
ggagctggaa gccttgcctt cccgcttcta 
tcggggaccc ctcagtgtag tgactaagcc 
ttagccacgc aaggtagctg aggtttgtga 
attgcatctg atggttcaag tgtcctgaga 
aggttggctt acaggtatgt atatgtgcag 
attcaacagc agatgcttgc gatgcagtgc 
tcatccctgc cttccttcct ttctttttcc 
gccttcgggt tttaaa 



aacagtggtt gcacaggaaa gaccctcctt 1800 
tgtcagatct gcgctgagaa gttcaaggtg 1860 
ttcgttgacg agacatggca gcagctggca 1920 
gagctgcaca gccgaccaca gccccacatc 1980 
gatccttatg gcatcatttt ccagaacggg 2040 
ggcgggactt cccagtggtg catccaaagg 2100 
catgcttgag cttgaaaagc agtcacctcc 2160 
atccacaggc caactcggcc aagggcaact 2220 
aacagtagga ttctcttttg gcaatggaga 2280 
ttgtttgcta cctaccccca gtcaggttct 2340 
aagaaacact taagatacaa gttcttttga 2400 
gtcaggtgat tctcactcct gtggatggct 2460 
tttgtgtgtt tttttttttt ttttaaaaaa 2520 

2536 



<210> 70 

<211> 1415 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1901373CB1 



<400> 70 

gcgaggacgc gggccgagcc ggaagtggag 
gtggttcgag agcgcgcaga gtccagactg 
gtccctggtc tctccagccc tcactcggaa 
gctggacccc tctctacagc taggagccaa 
tggaccaagg gaagtatgat gctgatgaca 
cagtgggcaa atccaaactc atggagagat 
tgtccacgta cgccctgacc ctgtacaagc 
ttgtggactt ttgggacacg gcaggccagg 
accacaaggc ccacgcctgc atcatggtgt 
acctgagcac ctggtataca gagcttcggg 
tggccaataa aattgatgca gacataaacg 
agttctccct gcccctgtat ttcgtctcgg 
tcaatgatgc aattcgatta gctgtgtctt 
agatttttca ggagctcgag aacttcagct 
aggaacagag cagcagcatc gagaccccat 
gggctggggc taggggtggg tggagccctt 
agctctgaat ggagaaactc tctaggccat 
cctattagcc tcccacattc aaggcccgtg 
ctggactggt ggaagaattc cccaccagat 
attaacacct tccccacccc ctccccgcag 
ttatgtgtca ctttaataca ggaaatttag 
gttttctttc caaagctcac ctcggggaca 
taccccccca cccacagctg agtctgtgag 
cttttttcct cttcagtctc ccagtcatct 



tgcgctgcgg cgcgagctgg gccggcgggc 60 
gcggcagggc ccgaggggcc gacccgcagc 120 
ccgcactgac aataccctcc cctcccttgg 180 
tggcagaaga caaaaccaaa ccgagtgagt 240 
acgtgaagat catctgcctg ggagacagcg 300 
ttctcatgga tggctttcag ccacagcagc 3 60 
acacagccac ggtagatgga aggaccatcc 420 
agcggttcca gagcatgcat gcctcctact 480 
ttgatgtaca gaggaaagtc acctatagga 540 
agttcaggcc agagatccca tgcatcgtgg 600 
tgacccaaaa aagcttcaat tttgccaaga 660 
ctgctgatgg taccaatgtt gtgaagctct 720 
acaaacagaa ctcccaggac ttcatggatg 780 
tggagcagga agaggaggac gtgccagacc 840 
cagaggaggt ggcctctccc cacagctgag 900 
ttaaaatacc. cttcccttca acaactctcc 960 
cccctcttct acctcctgca acccacccat 1020 
atacagggat gaggtcagca ccagcaaact 1080 
ctccttgaag cagaattagg gatcagcatc 1140 
gcagacagtg aagagaatca gaaaacatga 1200 
gtgttttttg gtgtttttgt ttttgttttt 1260 
attccttggg cttctcctga ggtaatgatt 1320 
gccccatcct ttccctacgt tttctcccat 1380 
ggttt 1415 



<210> 71 

<211> 1902 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2367767CB1 



<400> 71 

gcgaggtctg gctaggctac gggccacgcg 
tcggaggcgc gggcccgacg gaaaccatgt 
aggatctcat ccacgatgtc tctttcgact 



ccgccgccgc tgccgccgcc actgtcctct 60 
ttgtggctcg cagcatcgcg gcggaccaca 12 0 
tccacgggcg gcggatggca acctgctcca 180 
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gcgatcagag cgttaaggtc tgggataaaa gtgaaagtgg tgattggcat tgtactgcta 240 
gctggaagac acatagtgga tctgtatggc gtgtgacatg ggcccatcct gaatttgggc 300 
aggttttggc ttcctgttct tttgaccgaa cagctgctgt atgggaagaa atagtaggag 360 
aatcaaatga taaactgcga ggacagagcc actgggttaa aaggacaact ctggtggata 42 0 
gcagaacatc tgttactgat gtgaagtttg ctcccaagca catgggtctt atgttagcaa 480 
cctgttccgc agatggtata gtaagaatct atgaggcacc agatgttatg aatctcagcc 540 
agtggtcttt gcagcatgag atctcatgta agctaagctg tagttgtatt tcttggaacc 600 
cttcaagctc tcgtgctcat tcccccatga tcgccgtagg aagtgatgac agtagcccca 660 
acgcaatggc caaggttcag atttttgaat ataatgaaaa caccaggaaa tatgcaaaag 720 
ctgaaactct tatgacagtc actgatcctg ttcatgatat tgcattcgct ccaaatttgg 780 
gaagatcttt ccatattcta gcaatagcga ccaaagatgt gagaattttt acattaaagc 840 
ctgtgaggaa agaactgact tcctctggtg ggccaacaaa gtttgaaatc catatagtgg 900 
ctcagttcga taatcataat tctcaggtct ggcgagtgag ttggaatata acaggaacgg 960 
tgctagcatc ttcaggagat gatgggtgtg taagattgtg gaaagctaat tatatggaca 1020 
attggaagtg tactggtatt ttgaaaggta atgggagccc agtcaatggg agttctcagc 1080 
agggaacctc aaatccttcc ctaggttcaa atattccaag tcttcagaat tcattaaatg 1140 
gatcttctgc tggcagaaag cacagctgag tacaagctaa ctggagtaac tttgctgttt 1200 
tgctgcttgt tgcatgcaca caggaatgga aagcgagctc cttttcccct tccccagcgc 1260 
cgtttgacct ctcccaagat acaccagcag cctgcttact actaaacgca atccaaaagg 1320 
cctttaaaaa tacagtgtat attttttgta ctagtcagtt tattgacact atttgaaact 1380 
tttgaaatat aaacggagag gctttctgtt gagacattgt caccaaaaca attttttgaa 1440 
atgttcctga aactaatttg ggtttaaaga ttaaaagggt tgttaccatt cttatctgag 1.500 
tagttgggag gaggggaata ccactttagt tcatttggaa aatatagaca tatttctttt 1560 
gctttcttaa aacagcttaa aatgatgaac ttttataatt ttaatttgaa gattgaataa 1620 
atatttttta taaagattgt tttgagtgct gatttgttta ctttttgtag atttgcttta 1680 
tccatgatat tcagtacaac tctgtcattt ctttgtaata tttaaaaaat attagtaaag 1740 
gagtgaatta ataaagtagt aatagtaaaa tgaaaggaac ttgactgtac agtttgtagc 1800 
caggttaagc atttggtatt gtttcattta caatttggga ctaagatgga aacacttttt 1860 
ttataagttt ttaattcata gtcactaaag agataaatgt tt 1902 

<210> 72 

<211> 1681 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 3090433CB1 

<400> 72 

gggcagggct ttgctatggc taatgatccc ttggaaggct tccatgaagt aaaccttgct 60 

tcacctactt ctccggacct tcttggtgtg tatgaatcag gaactcaaga gcagactacc 120 

tcaccaagtg tcatctaccg gccacaccct tcagctttat cctctgtacc tatccaggca 180 

aatgcattag atgtttctga acttcctaca caacccgtgt attcatcccc cagacgttta 240 

aattgtgcgg aaatatctag tatcagcttt catgttacag acccagcccc ttgctctacc 300 

tctggagtca cagctggatt aactaaatta actacaagaa aggacaacta taatgcagag 360 

agagagtttt tacagggtgc tactataaca gaggcttgcg atggcagtga tgatattttt 420 

gggttgagta ctgatagtct gtctcgttta cgaagcccat ctgttttgga agttagagaa 480 

aagggctatg aacgattaaa agaagaactc gcaaaagctc agagggaact gaagttaaaa 540 

gatgaagaat gtgagaggct ttcaaaagtg cgagatcaac ttggacagga attggaagaa 600 

ctcacagcta gtctatttga ggaagctcat aaaatggtga gagaagcaaa tatcaagcag 660 

gcaacagcag aaaaacagct aaaagaagca caaggaaaaa ttgatgtact tcaagctgaa 720 

gtagctgcat tgaagacact tgtattgtcc agttctccaa catcacctac gcaggagcct 780 

ttgccaggtg gaaagacacc ttttaaaaag gggcatacaa gaaataaaag cacaagcagt 840 

gctatgagtg gcagtcatca ggacctcagt gtgatacagc caattgtaaa agactgcaaa 900 

gaggctgact tatccttgta taatgaattc cgattgtgga aggatgagcc cacaatggac 960 

aggacgtgtc ctttcttaga caaaatctac caggaagata tctttccatg tttaacattc 102 0 

tcaaaaagtg agttggcttc agctgttctg gaggctgtgg aaaacaatac tctaagcatt 1080 

gaaccagtgg gattacaacc tatccggttt gtgaaagctt ctgcagttga atgcggagga 1140 

ccaaaaaaat gtgctctcac tggccagagt aagtcctgta aacacagaat taaattaggg 1200 

gactcaagca actattatta tatttctcct ttttgcagat acaggatcac ttctgtatgt 1260 

aactttttta catacattcg atacattcag cagggactcg tgaaacagca ggatgttgat 1320 

cagatgtttt gggaggttat gcagttgaga aaagagatgt cattggcaaa gctgggttat 1380 
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ttcaaagagg aactctgatg ctctgcgtgg 
aaatggctga atatttttat ggttacttga 
tttttcccct tttgcaaatt gctctaagaa 
ctgtgtttgc ttattcttta gttgaacaca 
gtaatttata gttgccaaaa aaaaaacaaa 
a 



gaccatgcct gaactccccg aataactgaa 1440 
tatttatttc caaggagtga gcctaagact 1500 
gtaccatgat ttcttttaaa ctgatctatg 15 60 
ctatgaagaa ttccaggtgt actagtgaat 1620 
cctgaaataa ataaatgtta gattgaaaaa 1680 

1681 



<210> 73 

<211> 1378 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3800591CB1 

<400> 73 

ggcagatcct atctggcgca tgcgaacgct 
gactcaaggc tgttgcttcc gcccttactc 
ctgggctggg ggaagaggcg tggcggcgct 
gggggcgctg cggcacagct ggtttgagca 
acgcagacac tgaatggaat gacatcttac 
gtctgaaaga attggaagag gaggcagaag 
tgaaaacata tgaagatatg actttggaag 
aggaggatga acgtgctatt gaaatgtaca 
ctaaactgaa gaataaattc ggagaagttt 
aagttaccaa agctggcgag ggcttgtggg 
ccctctgtgc cctgataaat cagcacctca 
aatttatcaa agccatttca acaacctgca 
cgatatttgt ttacctggaa ggagatatca 
gcggcatgaa cctgacaaga gatgagttgg 
tgacagacct ggaggaaaac cctaagaagc 
ggcgctctgt cctcatgaag agggacagcg 
atcacatgcc gaactttctt gtgacaaatt 
gaatgaatcc ttgtggtttt tagttttgta 
aaataatcat tgctggagat tctgttaaat 
gtatttcctc taaaaaaaat taaaaccagc 
tcaatattac aattcaggtt tcctaatttt 
cgatcttagg aaggatcact tttggctgtt 
ttaaaaatgt tttgtcataa tttagttaat 

<210> 74 

<211> 1444 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5308471CB1 



tctgtgccga ttccttgaag agcaggcgca 60 
cccgccgctc gtccctgggc ggggcgaagg 120 
gtgcgcgtgc acaaaagaga gctgaggggc 180 
actgaactgg aaacaagatg caggacccca 240 
gcaaaaaggg tatcttaccc cccaaggaaa 300 
aggagcagcg catcctccag cagtcagtgg 360 
agctggagga tcatgaagac gagtttaatg 420 
gacggcggag actggctgag tggaaagcaa 480 
tggagatctc agggaaggat tatgttcaag 540 
tcatcttgca cctttacaaa caaggaattc 600 
gtggacttgc caggaagttt cctgatgtca 660 
tacccaatta tcctgatagg aatctgccca 720 
aggctcagtt tattggtcct ctggtgtttg 780 
aatggaaact gtctgaatct ggagcaatta 840 
cgattgaaga cgtgttgctg tcctcagtgc 900 
attccgaggg tgactgaggc tacagcttct 960 
gtctggattt tttaaaaaag gaaaaagcaa 1020 
taaattatgt ttcaaatctt tacattttgg 1080 
attttggaac tctttttttt ttaaattata 1140 
catttgtatg gcaaaaaaaa aaaagatact 1200 
ctaaaaccta tgggaatttt ctaggatgga 1260 
gtgagaaaca caaaataatt ttattacact 1320 
attaaccttg tttaacttta tagaaaga 1378 



<400> 74 

gcacgcagtg ccggaggccg cagcgccgga 
gaggtcagct gcgggagcgt ttccggggac 
ctgtgcagcg ccgcccaggc cgcctggcgg 
gcgcgctggt tcccgggcca catggccaag 
ctggtggact gtatcatcga ggtccacgat 
ctgtttcagg aaacccttgg gcttaagcct 
gcggatctta cagagcagca gaaaattatg 
gtcattttta ccaactgtgt aaaggatgaa 
gaactgattg ggagaagcca ccgctaccac 
gtcattgggg tccccaacgt gggcaagtcc 
ctcaggaaag ggaaagccac cagggtgggt 
tccaaaattc aggtctctga gcggcccctg 



acctcagagg cgggtcgcag cggcgcagag 60 
ggtgccgcca tgagattgac cccgcgcgcg 120 
gagaacttcc ccctgtgcgg tcgcgacgtg 180 
gggctgaaga agatgcagag cagcctgaag 240 
gcccggatcc cactttcagg ccgcaaccct 300 
cacttgctgg tcctcaacaa gatggacttg 360 
caacacttag aaggagaagg cctaaaaaat 420 
aatgtcaagc agatcatccc gatggtcact 480 
cgaaaagaga acctggagta ctgtatcatg 540 
tccctcatca actccctccg gaggcagcac 600 
ggcgagcctg ggatcaccag agctgtgatg 660 
atgttcctgt tggacactcc tggcgtgctg 720 
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gctcctcgga ttgaaagtgt ggagacaggc 
gaccacctgg tcggggagga gaccatggct 
cagcgctttg ggtacgtgca gcactacggc 
gtgctgaaga gtgtggctgt gaagctgggg 
acgggtaacg tgaacgttat tcagcctaac 
actttccgcc gtgggctgct gggttccgtg 
cccccggctg agactttgcc ctgaacttgt 
tcccagacct cctgacctgg gtggttgagg 
catgctggtc actagggtgc tgtgctctct 
ccccagttgc agggcccaag caggtgggag 
gagcagctcc gcatgcttgg ttctcccgga 
tgctgtctca gaaggagtta aagctataac 
aaaa 



ctgaagctgg ccctgtgtgg aacggtgctg 780 
gactacctgc tgtacaccct caacaaacac 840 
ctgggcagtg cctgtgacaa cgtagagcgc 900 
aagacgcaga aggtgaaggt gctcacgggc 960 
tatcctgcgg cagcccgtga cttcctgcag 1020 
atgctggacc tcgacgtcct gcggggccac 1080 
ccgggtaggg agggccggag gcatgtggcc 1140 
ctcaagacag ctcacccggt ccagaagctc 1200 
ggcgccccac agcctggcca gctccaggga 1260 
tggacaccag gcttcccagt ggacgtccct 1320 
gcttcctgct caggcctctt gagaaatgga 1380 
ctgtaacctt taaaatctca aaaaaaaaaa 1440 

1444 



<210> 75 

<211> 2067 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5324322CB1 



<400> 75 

ggcactgtcc accagcactc agagctccat 
gggactgggg tgtcccccag cctcagcaga 
tagcgccaga gaagcagctc agagcaaggg 
gtcatggggg ggctgcagta ggggtggtca 
gccatgttct cctcagacag aaactgcttg 
gaggcaggct gggatactga cacaggaggc 
cttgttggga gtccctccct gcccccaaac 
gctcatgggc acagctcctc ggagagggga 
ggcggcccgc gggaggcgct gggcccctcc 
gatcattggg ttttggggtc ctgcgggtgg 
ggcgtggcca agagaggatg ggcgtggctt 
ggcgcggaag acgggcggcg cgtggcggaa 
tatccggctt aagggggctg cggtggacac 
ctcacctcgg ctcctagggt tcgggacggt 
tagggcgcca cggagaggaa ccgctctagg 
gcggagcact ctgggacttg tagttctgga 
tctgggtcag acagaggtgt tccaggcctt 
gagcgtctca ccatgtggga agtttctggc 
cttcagcttg tcctctgctt tgagctcaga 
gactttccaa gcccatgatg ggcccgtcta 
tagtgctggg gatggggagg tgaaggcctg 
taaggagctg tggcgtcgtc agcctccata 
cgctttgctg ctggtcccca aggagaattc 
gcacactatg gaccttgaaa ctgggacttt 
catccactgc ctggcactgc gggaaaggag 
agctgttcga ctttgggacc tgcgcacagc 
gcacgaggag tgctcgaggc cccacaatgg 
cgactggatg gtctgtggag ggggcccagc 
acccaccacc atcttcccca tccgggcgcc 
gattctgtca gctggccagg gccgctgcgt 
ggcccaggtg cctggctcct ccccagggct 
cgcgcctgag tgcaaggtcc tgacagctgc 
caacctgggt taccgagcct tctccctgtc 
ctcagggttt tagagtgttt ttcattttct 
ggctttttta ccaaaaaaaa aaaaaaa 

<210> 76 
<211> 2085 
<212> DNA 



tatgtcccca gctggggttg cagggtaggg 60 
cggagggcct cagggatgag gctgccagga 120 
ctcctgagtg ggggcagggc tggggagaag 180 
ttgtgcaggc tgagttgaga gaagtgggtg 240 
cgcagaggct ccctggggag agatggcaga 300 
agcctgttgg ggaccagagg tgacagagat 360 
tcactgctcc tcctccaggc gccgcttggt 420 
gctggcgtcc aggccccaag tcacccccaa 480 
ctgggggcct cgctgcaagg gctgctgcag 540 
gatctgggcg acaggggagg agtctctgag 600 
taggcgggca cagccgcgag gttctgcgcg 660 
ggcaggcttg ctcctcgggg tgggggaggg 720 
cacttcttaa tgtcgggggt cttcgcggcg 780 
acgcaccagc caccttcgcg ccgaaggcgg 840 
cacgtaaggc ctcgtgaggt tgcgtcgcgc 900 
gatggagcga gctgtgccgc tcgcggtgcc 960 
gcagcggctc catatgacca tcttctccca 1020 
ggctggcaac aattacgggc agattgccat 1080 
agccaaagag gaaagtaaga agccggtggt 1140 
tagcatggtt tccaccgatc gacatctgct 1200 
gctttgggcg gagatgctca agaagggctg 1260 
caggaccagc ctggaagtgc ctgagatcaa 1320 
cctcatcctg gctgggggag actgtcagtt 1380 
cacgagggtc ctccggggcc acacagacta 1440 
cccagaggtg ctgtcaggtg gcgaggatgg 1500 
caaggaggtc cagacgatcg aggtctataa 1560 
gcgctggatt ggatgtttgg caactgattc 1620 
cctcaccctc tggcacctcc gatcctccac 1680 
acagaagcac gtcaccttct accaggacct 1740 
caaccagtgg cagctgagcg gggagctgaa 1800 
gctcagcctc agcctcaacc agcagcctgc 1860 
aggcaacagc tgccgggtgg atgtcttcac 1920 
cttctgatct ctgacgacac ccccagccag 1980 
tttttttttt ttttttacaa taaagtttca 2040 

2067 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 067184CB1 

<400> 76 

gtgttgcgcg actggccttg agggagagct ggggcctgct cccggagaga tacggctatg 60 
tcgatcgaaa tcgaatcttc ggatgtgatc cgccttatta tgcagtactt gaaggagaac 12 0 
agtttacatc gggcgttagc caccttgcag gaggagacta ctgtgtctct gaatactgtg 180 
gacagcattg agagttttgt ggctgacatt aacagtggcc attgggatac tgtgttgcag 240 
gctatacagt ctctgaaatt gccagacaaa accctcattg acctctatga acaggttgtt 300 
ctggaattga tagagctccg tgaattgggt gctgccaggt cacttttgag acagactgat 3 60 
cccatgatca tgttaaaaca aacacagcca gagcgatata ttcatctgga gaaccttttg 420 
gccaggtctt actttgatcc tcgtgaggca tacccagatg gaagtagcaa agaaaagaga 480 
agagcagcaa ttgcccaggc cttagctggc gaagtcagtg tggtgcctcc atctcgtctc 540 
atggcattgc tgggacaggc actgaagtgg cagcagcatc agggattgct tcctcctggt 600 
atgaccatag atttgtttcg aggcaaggca gctgtcaaag atgtggaaga agaaaagttt 660 
cctacacaac tgagcaggca tattaagttt ggtcagaaat cacatgtgga gtgtgctcga 720 
ttttctccag atggtcagta tttggtcact gggtctgttg atggattcat tgaagtatgg 780 
aactttacta ctggaaaaat cagaaaggat cttaagtacc aggcccaaga taactttatg 840 
atgatggatg atgctgtcct ctgcatgtgt ttcagcagag atacagaaat gttagcaact 900 
ggggcccaag atggaaaaat caaggtgtgg aagattcaga gtggacaatg tttaaggaga 960 
tttgagaggg cacacagtaa gggtgtcacc tgtctaagct tttctaagga tagcagtcag 1020 
atccttagtg cttcttttga ccagacaatt agaattcatg gtttaaaatc tgggaaaacc 1080 
ctgaaggaat ttcgtggcca ttcctccttt gttaacgaag caacatttac acaagatgga 1140 
cattacatta ttagtgcatc ctctgatggc actgtaaaga tctggaatat gaagaccaca 1200 
gaatgttcaa atacctttaa atccctgggc agcaccgcag ggacagatat taccgtcaac 12 60 
agtgtgattc tacttcctaa aaaccctgag cactttgtgg tgtgcaacag atcaaacacg 1320 
gtggtcatca tgaacatgca ggggcagatt gtcagaagct tcagttctgg taaaagagaa 13 80 
ggtggggact ttgtttgctg tgccctctct ccccgtggtg aatggatcta ctgtgtaggg 1440 
gaggactttg tgctctactg tttcagtaca gtcactggca aactggagag aactttgaca 1500 
gtgcacgaga aggatgtgat tggtattgca catcaccctc atcagaacct gattgctacc 1560 
tacagtgaag atggactcct aaagctctgg aaaccataat tcaacttttc tttttaaatc 1620 
agctcgaaag catgtactta aatgaagcat attcatgtaa tgtgcttttt tttttttttt 1680 
gccagctttt ctaagcaaat agattgtctg aattagtcac agaataattt tgtgaaaatt 1740 
catgtttaag tagcaactac cctttctttt tttatatatt tttaaggtat tagtttatct 1800 
tcttctaact ggtgcagtca cttaatgttt tcattaatct tcgacctgga gagtgaaata 1860 
ctgatatttc tagaaaaaaa ttctactcct ctgattattt gaaatgctga ggaaaatgtc 1920 
cctcccatag taaaacttgt aaataaggaa ctatatcata ttcagtagct gtgttctgtt 1980 
ccatcttttt tttttttttt gagatggagt tttgcttgtt gcccaggctg gagtgcagcg 2040 
gcacgatctt ggttcactgc aacctccgcc tcccaggttc aagcg 2 085 

<210> 77 

<211> 2061 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 722896CB1 

<400> 77 

cgagccgcgg gacccgggcc gtaccgggga ggggccgctc cgggccgcag cgcgagggca 60 
gcgaggggcg gcggggacct ggcaccgggc ggggccggcg gcagcgacca tgatcgcttt 12 0 
gttcaacaag ctgctggact ggttcaaggc cctattctgg aaggaggaga tggagctcac 180 
gctggtcggg cttcagtact cgggcaagac caccttcgtc aacgtgatcg cgtcaggaca 240 
gttcaacgag gacatgatcc ccaccgtggg tttcaacatg cgcaaaatca ccaaagggaa 300 
tgtgactatc aagctctggg acattggggg acagccgcgt ttccgcagca tgtgggagcg 360 
ctactgccga ggagtgagcg ccatcgtgta catggtggat gctgctgacc aggagaagat 420 
tgaggcctct aagaacgagc tccacaacct actggacaaa cctcagctgc agggcatccc 480 
ggtcttagtc ctgggtaaca agcgagacct tccgggagca ttggatgaga aggagctgat 540 
tgagaaaatg aatctgtctg ccatccagga ccgagagatc tgctgctact ccatctcttg 600 
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caaagaaaag gacaacattg acatcaccct acagtggctt attcaacact cgaagtcacg 660 
gagaagctga gactccagcc cttctccctc agaccaggga ccgtcatcat ctaaacctga 720 
agccgagctc cccgcccacc cctgtcgtcc ccctaagccc acccctcctc acccagtgtg 780 
aggagggccc tctggggacc ccagagtcct gttctgctga ggtttgaact cctgttttta 840 
ttgtaaaata aattgccccc cattctggtc ccctaacttc tcacccttcc ccgctgcctt 900 
tgtcccatca cccagccctg cctccctccc agcagccctg ggccacagcc cccgcccctg 9 60 
gcttttcccc ggcccggtct tgtacctccc ttttcaacac tctctgttat tgtcctgtgt 1020 
gtacagtata tatatgtata tatattttaa ttttttaatt taagcaaaga ctaaaatcaa 1080 
ccatttgatg ctgcaggggc ctttcaggat ctgggagggg gcagtctgga gagaaggagg 1140 
gagacgcagg tggacttggg gcaagttcag atcagaagag gtgcaggctg gcacctgcgg 1200 
caggtaccag cctgggcact ggtggccgcc tccctgtccc gtgtgtttcc accgcccaat 12 60 
ctggcttgtc ctggcagtgc ttgaatgcca caggctggca ggggcctctg ggggcccctc 1320 
ccctcgaccc ccagcctggg tagagccacc aggtacgacg accaggtacc agaaaccacc 1380 
aggcacacgg ggcagaaagc cagcgtccat gccccagcag ccccctcctg cctgttcctg 1440 
gctcccagct cccgcccctc cccagggccc ccacctccac ggcccacttc attttctgtt 1500 
ctcattttgc agagttgcac aaggagagaa ctcagcatgg ggggttggtt ctttgggttc 1560 
tgtttgttta tttgtttaat ttaatgattt gtaaagtgat gttcctcttc cttttttaca 1620 
cttttcagct catatttaac ctctgtttgg aaaatgattc ttgtaactgt acattttttt 1680 
gcttcctaat aacaatgaca acaaaaaaaa taaatgacca gttttgtgtt ggggggggtg 1740 
tatggtgctg gttacttttc cgcagttggc atgggttgcc ctacaggccc acagggccac 1800 
cagcacaccc ccgcacgctg ggcaccaaca gagccacgga gcgcgagcac atgcccgccc 1860 
ggggagcaca atggcgctgc acaaaacggc ctcccacacg tgcgtccagg ctcttgcgcc 1920 
acctccttct cattctcttt tcagactttc atgtagtccc agctttgagc cagcagctgc 1980 
cacttggggc tgcagcgctc tgttgaggga actgcccagg gctgggtaga ggcagcaagg 2 040 
ggacagggct gggtgctgtt t 2061 

<210> 78 

<211> 981 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1571739CB1 

<400> 78 

gtagttccag aaaataggac 
tgctgtcttg ggtggtgaag 
gcgattcatt ggagaatatg 
ggaaaggaaa caactaaatc 
attctccctc acaagtgagc 
tgataggtct tcatttgctt 
tagtcattgt aaaagagctg 
ttgtcatgtg cgagaggttg 
ccaattctgt gaactgtctg 
aattatcaag gacatcctga 
taaatcaatg gccaaattga 
gacaggtaat cctgggagat 
aactaacctt tgtatgcaat 
attgagtatg aaccacagct 
cctgctcact ctccttcaca 
acctactggc ggacagcgat 
tattaaacga caaaaaaaaa 

<210> 79 
<211> 1375 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1739479CB1 



tgaccaagaa gcagaaaagc 
gaacaggcaa atctgccctt 
cttctaattt tgaatctatc 
tagaaatata tgacccttgt 
ttcactgggc agatgggttt 
ttgcaaaagc gctgatctac 
tggaatcagc agtgtttttg 
gctgggaaga agggcaaaag 
cagcagagca gtctctggag 
taaacttcaa actcaaagaa 
tcaataatgt atttggaaag 
ttcctatatc agagagtttc 
ttttttttgg taaaaagaat 
gtgttttcaa atatgtagtt 
cagaaccttt catttattgt 
cccagtttgc cttgccaaat 
a 



aagatgaatg atgtgaagct 60 
acagtgaggt ttcttactaa 120 
tataagaagc acttgtgttt 180 
tctcaaacac agaaagcaaa 240 
gttattgtgt atgacatcag 300 
agaatccggg agccacaaac 3 60 
gttggcaaca aacgagatct 420 
ctggcactgg aaaaccgatg 480 
gtggaaatga tgtttatcag 540 
aagagacgtc ccagtggatc 600 
agaaggaaat ctgtttagta 660 
aaacattcac atgataatta 720 
tctcttggag atatgaaatg 780 
tgcctttttg gttgttgtac 840 
acaacatcac actcacccta 900 
aaactctgtt tatgtgaatt 960 

981 
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<400> 79 

aattctgtgc ttctccctct tgggccttca cacacttatg cttatgtaaa taattattaa 60 
tcatattttc atgatggtgg agattcttat tcaccactac atcccatccc cagggcctgc 120 
cacagaatag gcattcagta aatatttttt gaatgattga ctgagaaatc acacctctgt 180 
ttcttttaaa cacatcctga tagctccata agtttcatca gggtcagtgg tttccattgt 24 0 
cctgactgct ggccacagtg acctgttctg tgctttattt gacaagaccc ctgaatggtt 300 
ggacagtgat tcctgccaaa agtgtgatca gcctttcttc tggaacttca agcaaatgtg 360 
ggacagtaag aaaattggtc taagacagca ccactgccgc aagtgtggga aggccgtctg 420 
tggcaagtgc agctccaagc gctcctccat ccccctgatg ggcttcgagt ttgaagtgag 480 
ggtctgtgac agctgccacg aggccatcac agatgaagaa cgtgcaccca cagccacctt 540 
ccatgacagt aaacataaca ttgtgcatgt gcatttcgat gcaaccagag gatggttact 600 
gacttctgga actgacaagg ttattaagtt gtgggatatg accccagtcg tgtcttgatg 660 
actctcccag gaatcagaaa gatagtattt actaaagaaa cggttgtttt aacccaaatc 720 
attaccagag tggtaaagca gacatgtgag aagtaagaaa gaaactaaag accctgaatg 780 
aatttgcaga ttacccatgt gcacagtggg gacctggcca gtgagcactc gcaaggggac 840 
tcttccaact tgttcataca atataaaaga agctattttt ttaacaaatg gtttatacag 900 
tctggctgtg ctgcattgtt ttgagtgtac cgaaaaatct gtgtggggtg tttaattttt 960 
atacttttca acaccccatt ttatttgttg ctttgtcaga gaaataaggg aggtatctac 1020 
tcagagtatt ttggtcatta tactttctgt gtttacttca acatgtgtca cgtggccagc 1080 
ggctttttct tctcttccct ctgcacctac ctgcaccttc tctgcctttc ctggagggga 1140 
tgtatttatg ttatttattc ccagtgtttc tgctttcatg tcctcctcag tggagagatt 1200 
tggaaactca tcatgtggat tcaccagcca gctgctggaa ttgcctgaag agcgatttgt 12 60 
ttgtaatgtc tgcctcattc acgttcttat gaagtagaaa agactgtgtt tctgcctcag 1320 
ttgcctctgt ctttcccaca ttaaaaaaaa aaatgctgtg agaaaaaaaa aaaaa 1375 

<210> 80 
<211> 2833 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1999147CB1 

<400> 80 

cggttgggac gcacacactc tgcgtcatgg agggctgagg ccgatgatga attccggagt 60 
gcctgtcagg cttgctgtgt cactcggccc gctcggcgcg ccccttccca gccgcccttc 120 
cgtaccggct ctcgggctct tccggtctcc ggccgcccct tacctgcagg ctcttctccc 180 
gccgcggccc ggcgctctcc gagtcgcccc tgcggactgg tctcgcacag tgcctgggca 240 
ccgggcgcca gacagacact ggccatgacg agcggcgcaa ccaggtaccg gctgagctgc 300 
tcgctccggg gccacgagct ggacgtacgg ggcctggtgt gctgcgccta tccgccggga 360 
gcctttgtgt ccgtgtcccg agaccgcacc acccgcctct gggccccaga cagtccaaac 420 
aggagcttta cagaaatgca ctgtatgagt ggccattcca attttgtatc ttgtgtatgc 480 
atcataccct caagtgacat ctaccctcat ggcctaattg ccaccggtgg aaatgaccac 540 
aatatatgca ttttctcact ggacagtcca atgccacttt atattctaaa aggccacaaa 600 
aatactgttt gtagtctatc atctggaaaa tttgggacat tacttagtgg ttcatgggac 660 
accactgcta aagtctggct gaatgacaag tgcatgatga ccttgcaggg tcatacagct 720 
gcagtgtggg cggtaaagat cttacctgaa cagggcttaa tgttgactgg atcagcagac 7 80 
aagactgtta aactgtggaa ggctggaaga tgtgagagga ctttttcagg gcatgaagac 840 
tgtgtaagag gtttggcaat tttgagtgaa acagaatttc tttcctgtgc aaatgatgct 900 
agtattagaa ggtggcaaat cactggcgag tgtcttgaag tatattatgg acatacaaat 960 
tatatttata gcatatccgt ttttccaaat tgtagagact ttgtgacaac agcagaggac 1020 
agatctctga gaatctggaa acatggggaa tgtgctcaaa ctatccgact tccagctcag 1080 
tctatatggt gctgctgtgt gctcgacaat ggtgacattg tggttggtgc gagtgatggc 1140 
attattagag tgtttacaga atcagaagat cgaacagcaa gtgctgaaga aatcaaggct 1200 
tttgaaaaag aactgtctca cgcaaccatt gattctaaaa ctggcgattt aggggacatc 1260 
aatgctgagc agcttcctgg gagggaacat cttaatgaac ctggtactag agaaggacag 1320 
actcgtctaa tcagagatgg ggagaaagtc gaagcctatc agtggagtgt tagtgaaggg 1380 
aggtggataa aaattggtga tgttgttggc tcatctggtg ctaatcagca aacatctgga 1440 
aaagttttat atgaagggaa agaatttgat tatgttttct caattgatgt caatgaaggt 1500 
ggaccatcat ataaattgcc atataatacc agtgatgacc cttggttaac tgcatacaac 1560 
ttcttacaga agaatgattt gaatcctatg tttctggatc aagtagctaa atttattatt 1620 
gataacacaa aaggtcaaat gttgggactt gggaatccca gcttttcaga tccatttaca 1680 
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ggtggtggtc ggtatgttcc gggctcttcg 
ccttttacag gtgctggtcg ttatgtacca 
ggagttgatc catttacagg gaatagtgcc 
atttatttcc ctaaaaaaga ggctgtcaca 
ggtaaactga aggaacttaa tggaactgca 
ttgatacttc ttgagaagat actgtctcta 
gtccagcaac ttcagatttt gtggaaagct 
gcacttgaca ttcttcggtt gtcaattaaa 
gaaaaggaag gggctcagtt cagcagtcat 
ccagcaaacc agctgcttgc tctcaggact 
caaaaactca tgatgtccca gagggaatca 
gggagcaata agaacattca cattgctctg 
tttcataaag accataacat tgaagggaaa 
ttggaagtag tacaagacct agaagccact 
atcagtgatg attcaaatgc tgtacaatta 
aaaaagtatt cctcagtatc agaaccagct 
aatttgctgt agcagtgggg aagagggacg 
tcacatttga catgactgat aacagataat 
aattttacat cttgtaaagt ggtggggagg 
aaaaaaaaaa aaa 



ggatcttcta acacactacc cacagcagat 1740 
ggttctgcaa gtatgggaac taccatggcc 1800 
taccgatcag ctgcatctaa aacaatgaat 1860 
tttgaccaag caaaccctac acaaatatta 1920 
cc tgaagaga agaagttaac tgaggatgac 1980 
atatgtaata gttcttcaga aaaacccaca 2040 
attaactgtc ctgaagatat tgtctttcct 2100 
caccccagtg tgaatgagaa cttctgcaat 2160 
cttatcaatc ttctgaaccc taaaggaaag 2220 
ttttgcaatt gttttgttgg ccaggcagga 2280 
ctgatgtccc atgcaataga actgaaatca 2340 
gctacattgg ccctgaacta ttctgtttgt 2400 
gcccaatgtt tgtcactaat tagcacaatc 2460 
tttagacttc ttgtggctct tggaacactt 2520 
gccaagtctt taggtgttga ttctcaaata 2580 
aaagtaagtg aatgctgtag atttatccta 2640 
gatattttta attgattagt gtttttttcc 2700 
taaaaaaaga gaatacggtg gattaagtaa 2760 
ggaaacagaa ataaaatttt tgcactgctg 2820 

2833 



<210> 81 

<211> 1752. 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2182085CB1 



<400> 81 

gcaggcagcc atcttgcctg gagcctgaga 
agtggtctca gggccgctcc ggggggcctc 
cgggcggaca cgccagagga ggaggccggg 
cagtggacgc gaggtacaac gcgtaccgca 
atatccgccg gcgcagcagc tgctgcggga 
gcgtcggcag tacctgaggc ttcgggggca 
cgagccaggc agtgctcgtg cctatagcaa 
tgaccgcatg gaggactttg aggatgatcc 
ttctgtcagc agaggctcct accagctgca 
caggccccct ggcagcgtgg tgcccacgtc 
ggacacgtca ctgagcgaga actatgcctt 
cgtggatgag gcagtcccaa gggtgcgctt 
ctgctcactc gacggcagca tctccctgtg 
tcgcgtgcta cggggccaca cccgtggtgt 
catcctcgtg tccacctcac tggatgccac 
ctgcatccga gagatccctg accccgatag 
tgtcaacaac aacctcactg tggtggggaa 
ctccacaggc aagaaagtga aggggggctc 
gtcctttgat gcccctggcc ggctgctctg 
tttcctcttt gatatggcca cagggaagct 
ggggagccct gtgaccagca tctcagcccg 
ctcactgctc atcaatgctt gcctcaacaa 
ggggaccctg cagctgaaga gaagcttccc 
catcttctgt cccctcatgt ccttccgcca 
catgtgcgtg cacttctttg atgtggagcg 
gggccacagt gcacctgtgc ttgatgtcag 
cagtgacgcc agcggcatgg tcatcgtctg 
ccctgctgct gtcctccatc ccacccctct 
cggtggtcat gctgagggcc ggctcccagc 
cgggcagctg ca 

<210> 82 



aagggaggag agacagaagg aaccggcgac 60 
aagaaccgga ggcagccccg gaggctgccg 120 
gaatggccgc ggtgtggcag caagtcttag 180 
caccaacgtt tccacagttt cggacgcagt 240 
gaatgccaag gctgggcacc ccccagcgct 300 
gctgctgggc cagcgctacg ggcccctctc 360 
cagcatcgtc cgcagtagcc gcactactct 420 
tcgggccctg ggggcccgtg ggcaccgtcg 480 
ggcgcagatg aaccgtgccg tctatgagga 540 
agcagcagag gcaagtcggg ccatggccgg 600 
tgcgggcatg tatcatgttt ttgaccagca 660 
cgccaatgat gaccgacacc gcctggcctg 720 
ccagctggtg cctgccccac ccacagtgct 780 
ctccgacttc gcctggtccc tctccaatga 840 
catgcgcatc tgggcctctg aggatggtcg 900 
cgctgaactg ctctgctgca ccttccagcc 960 
cgccaagcac aacgtgcatg tcatgaacat 1020 
cagcaagctg acaggccgtg tccttgctct 1080 
ggcgggtgat gaccgtggca gtgtcttctc 1140 
gaccaaagcc aagcgtttgg tggtgcatga 1200 
gtcctgggtc agccgcgagg cccgggatcc 12 60 
gttgctgctc tacagggtgg tagacaacga 1320 
catcgagcag agctcacatc ctgtgcgcag 1380 
gggggcctgc gtggtgacgg gcagtgagga 1440 
ggcggccaag gctgctgtca acaagctgca 1500 
cttcaactgc gacgagagcc tactggcctc 1560 
gaggcgggag cagaagtagg gtcctgtcgg 1620 
tactccagcc tcgtgttgta aataaagttt 1680 
tctgccgggg acggacaggg cagagggcag 1740 

1752 
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<211> 1854 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2216640CB1 

<400> 82 

cccacgcgtc cgcgcaggat ggcggcagca gtggcggacg aggcggtggc gcgcgatgtg 60 
cagcggttgc tagtgcagtt ccaggatgag ggcgggcagc tgctgggttc cccgttcgac 120 
gtgcccgtgg acatcacccc ggacaggctg cagctcgtgt gcaacgcgct actggcccag 180 
gaggatcccc tgccactggc tttctttgtc cacgatgctg agatcgtctc ctcactgggg 240 
aagacgttgg agtcccaggc agtggagaca gagaaggtcc tagacatcat ctaccagcca 300 
caggctatct tcagagtccg ggctgtgact cgctgcacca gctccttgga gggtcacagt 360 
gaggcagtca tttctgtggc cttcagccct acgggaaagt acctggccag tggctctgga 420 
gacaccaccg tgcgcttctg ggatctcagc acagagacac cacatttcac atgcaaggga 480 
cacagacact gggtccttag tatatcctgg tctccagatg gcaagaagct ggcctcaggc 540 
tgcaagaatg gccagattct cctctgggac ccaagcacag ggaagcaggt gggcaggacc 600 
ctcgctggcc acagcaagtg gatcacaggc ctgagctggg agcccctcca tgcgaaccct 660 
gagtgccgct atgtggccag cagctccaag gatggcagtg tgcggatctg ggacacaact 720 
gcaggccgct gtgagcgcat cctcaccggg cacacccagt cggtcacctg tctccggtgg 780 
ggaggggacg ggcttctcta ctctgcctcc caggaccgca ccatcaaagt ctggagagct 840 
catgacggtg tgctgtgccg gactctgcaa ggccacggcc actgggtgaa caccatggcc 900 
ctcagcactg actatgccct gcgcactggg gcctttgaac ctgctgaggc ctcagttaat 960 
ccccaagacc tccaaggatc cttgcaggag ttgaaggaga gggctctgag ccgatacaac 1020 
ctcgtgcggg gccagggtcc agagaggctg gtgtctggct ccgacgactt caccttattc 1080 
ctgtggtccc cagcagagga caaaaagcct ctcactcgga tgacaggaca ccaagctctc 1140 
atcaaccagg tgctcttctc tcctgactcc cgcatcgtgg ctagtgcctc ctttgacaag 12 00 
tccatcaagc tgtgggatgg caggacgggc aagtacctgg cttccctacg cggccacgtg 1260 
gctgccgtgt accagattgc gtggtcagct gacagtcggc tcctggtcag cggcagcagt 1320 
gacagcacac tgaaggtgtg ggatgtgaag gcccagaagc tggccatgga cctgcccggc 13 80 
cacgcggatg aggtatatgc tgttgactgg agtccagatg gccagagagt ggcaagtggt 1440 
gggaaggaca aatgcctccg gatatggagg agatgagacg gcccgaagtt ctctctgacc 1500 
cccacctcga ctcggcctct gccagctgcc ttccctgcca gagaacaaag gctgagatgg 1560 
cagtgcacac accctcccca ccagtgggga cctgagaatg cgtgtggcct gctgtcctcg 1620 
atagaccgga atggggtttt cccacagatc cccgcctgtg gcacacccca gagccagaaa 1680 
tcgaaggtca caggaagttg tcactgaact tggcccgtgt ctgctactct gtaccttgct 1740 
ggtacagaca ggggtggtgg gcagccaggc tctatgagtg ggcccctagt gtcagctctg 1800 
tacagggtca gatcccaggt tctatgacca aataagtaac ttaaaaaaaa aaaa 1854 

<210> 83 

<211> 862 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2417361CB1 

<400> 83 

ggcgtggctt caacagactt tcttttgcct gtctttgtcc cagagcctct tccctggccc 60 
tgctgagacc actgctctaa gaagagacca ccagactgag agaggactcc cagctgccct 120 
cagagcggag gccgagtgct gcacagccac agctgctctg aagcccttcc atgaatcccc 180 
ggaagaaggt ggacctgaaa ctcattatcg tcggagccat tggtgtggga aagacctccc 240 
tccttcacca atatgtgcac aagacgtttt atgaggaata ccagaccaca ctgggggcca 300 
gcatcctctc caagattatc atattgggtg acacaacttt gaagttacag atctgggaca 360 
cgggcggtca ggagcggttc cgctccatgg tgtccacgtt ctacaagggc tccgatggct 420 
gcatcctagc ttttgatgtc accgacctgg agtcttttga agccctggat atctggcggg 480 
gtgatgtcct ggccaagatt gtccccatgg agcagtccta ccccatggtg ttgttgggga 540 
acaagatcga tctggcagac cggaaggtac cccaggaagt agctcaaggc tggtgtagag 600 
agaaagatat tccttacttt gaagtcagtg ccaagaatga catcaatgtg gtgcaagcgt 660 
ttgagatgct ggccagtagg gctctgtcga ggtaccagag catcttagaa aatcacctca 720 
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cagaatccat caagctctcg ccagaccagt caaggagcag atgctgctga cctccagacg 780 
cctgctctgg aagcccagaa acagagcctg ccccgagcct ggtcacccca ggcttgagaa 840 
caggtgacca tccccctcca gc 862 

<210> 84 

<211> 1406 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2454384CB1 

<400> 84 

ctagagcctg gggtctcggc aacttccggc ggcgggagct gcagagcgca aggcccgccc 60 
actgcgcgtg cgcttcggcc cggctcctcc tgcgcccccg gcccctgcga ctgggacttg 120 
gtacggccgg gcggttggcg tcctctgcgg ctcctgccag gggcgggctt ttcaaatctt 180 
ccctttgaag gagtggcgac ggcccggaca gttcgcgttg gagatggagg ggccgagcct 240 
gaggggtcct gcgctccgcc tggcggggct tcccacccag caggactgca acattcaaga 300 
aaaaatagac ttagaaattc gaatgcgaga aggaatatgg aaactccttt ctctgagcac 360 
tcagaaagat caagttttac atgcagttaa gaatctcatg gtgtgcaatg ctcgactaat 420 
ggcctataca tcggagctac agaaattaga agaacagatt gcaaatcaga ctggaagatg 480 
tgatgtgaaa tttgaaagta aagaacgaac agcatgtaaa ggaaagattg ccatatcaga 540 
tattcgaata ccactaatgt ggaaagactc tgatcacttc agcaataaag aacgatcacg 600 
acgctatgcc attttttgtt tattcaaaat gggagctaat gtgtttgata ctgatgtggt 660 
gaatgtggat aaaacaatca cagatatatg ttttgaaaat gtaaccatat tgtaagtatt 720 
ttttaatctt cagagaataa aaataattta aaattcttct tttttaaaag aaagttctta 780 
ttattggttc tttggattca ttttatgttt aaatgtttaa gtgatcttta aatgtttaat 840 
atgattttaa aaattatttt gttcagaaga agtccatttc tctatctgca gttttctgat 900 
gtgaaataaa aatggaaatc ttgtaattac tattagcagt aaatatttga cttattagat 960 
atgacccatt tttaaattgt taataaatat agttcagtta ttaacaaagc tatgcataca 1020 
acagaatatc ctgtaatgtt atttgatata gagagaattt aagcataaaa caggattttt 1080 
atctcatgta ggatatttgg ttgcagaaat actaaaatag tatagcgact ttatttacaa 1140 
gatagtcctg aagtacatgc tatataggaa gagcactttg aaattttggg gtgttctttt 1200 
tcttatggtg cacttctttc atgtacttca aagcaataaa aaaaaatggg tgatctcagg 1260 
gctgttttta ttgtccctgc tcttttacag gctcatttta ttgtggtcat aatacagaac 1320 
aagaaggaac tccttgggta gccatagaaa tcatttttaa cttacatagt ttttcctgcc 1380 
ctccttcaaa ggttctatgt gcctaa 1406 

<210> 85 

<211> 1184 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2610262CB1 

<400> 85 

gcggttttgg tgcctgaagc agggagcgcg gagtcgttcc cgagagaggc ggccaggcta 60 
tgctcgccgg tttccggcgt tccgctccgg ccagccagag tctctgtctc aacctgtgtc 120 
cgtgctccag cagtctcctc agcccggccc cgcggcgcgg ttggcggcgg cgccccaggc 180 
gcgccccctc ctccgatggc ggcggagatc cagcccaagc ctctgacccg caagccgatc 240 
ctgctgcagc ggatggaggg gtcccaggag gtggtgaata tggccgtgat cgtgcccaaa 3 00 
gaggagggcg tcatcagcgt ctccgaggac aggacagttc gtgtttggtt aaagagagac 360 
agtggacagt attggccaag cgtataccat gcaatgcctt ctccatgttc atgcatgtct 420 
tttaacccgg aaacaagaag actgtccata ggtctagaca atggtacaat ctcagagttt 480 
atattgtcag aagattataa caagatgact cctgtgaaaa actatcaagc gcatcagagc 540 
agagtgacga tgatcctgtt tgtcctggag ctggagtggg tgctgagcac aggacaggac 600 
aagcaatttg cctggcactg ctctgagagt gggcagcgcc tgggaggtta tcggaccagt 660 
gctgtggcct caggcctgca atttgatgtt gaaacccggc atgtgtttat cggtgaccac 720 
tcaggccaag taacaatcct caaactggag caagaaaact gcaccctggt cacaacattc 780 
agaggacaca caggtggggt gaccgctctc tgttgggacc cagtccagcg ggtgttgttc 840 
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tcaggcagtt cagatcactc tgtcatcatg tgggacatcg gtgggagaaa aggaacagcc 900 

atcgagctcc aaggacacaa cgacagagtc caggccctct cctatgcaca gcacacgcga 960 

caattgatct cctgtggcgg tgatggtggg attgtcgtct ggaacatgga cgtggagagg 1020 

caggagcctc tgtggagctg cttcgtggtt atgataagtg ctgtgtgatg ctcaccttgg 1080 

gaggtctgcg acatatattg aagtcatctc taacctgaag tactgacaga ctttctggaa 1140 

gaaaaggctt gtaggaggaa acttcagaat tctattaaat ggtg 1184 

<210> 86 

<211> 2965 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2700075CB1 

<400> 86 

ggcaccactg tgaaggtctg ggacgcagcc aagcagcagc ccctgacaga gctggcagcc 60 
catggggacc tggtgcagag cgccgtctgg agccgagatg gagccctggt gggcacggcg 120 
tgcaaggaca agcagctgcg gatctttgac cccagaacaa agccgcgggc ctctcagagc 180 
acgcaggccc atgagaacag cagggatagc cggctggcat ggatgggcac ctgggagcac 240 
cttgtgtcta ctggattcaa ccagatgcgt gagcgcgaag tgaagctgtg ggacacgcgg 300 
ttcttctcca gcgccctggc ctccctcacc ttggacacct cgcttgggtg tctcgtgcct 360 
ctgctggacc ctgactctgg gctcctggtc ctggcaggaa agggcgagag gcagctgtac 420 
tgttacgagg tggtcccgca gcagccggcg ctgagcccag tgacccagtg tgtcctggag 480 
agcgtgctgc gtggggctgc ccttgtgccc cggcaggcgc tggccgtcat gagctgcgag 540 
gtactccgcg tcctacagct gagcgacaca gccatcgtgc ccatcggcta ccatgtgccc 600 
cgcaaggctg tggagttcca cgaggacctg ttcccggaca ctgccggctg tgtgcctgcc 660 
accgaccccc atagctggtg ggctggggac aaccagcagg tgcagaaggt cagcctcaac 720 
cccgcctgcc ggccccaccc gagcttcact tcctgtctgg tgccccctgc ggagcccctc 780 
cctgacacag cccagcctgc ggtgatggag acacccgtgg gtgatgcaga cgcaagcgag 840 
ggtttctctt cccctcccag ttcgctgacc tcgccctcca cgccctccag cctggggccc 900 
tcactctcca gcaccagtgg catcgggacc agccccagtt tgaggtcgct gcagagcctg 960 
ctgggcccca gttccaagtt ccgccatgct cagggcactg tcctgcaccg agacagccac 1020 
atcaccaacc tcaaggggct caacctcacc acacctggtg agagtgacgg cttctgtgcc 1080 
aacaagctgc gtgtggccgt gccgctgctc agcagcgggg gacaggtggc tgtgcttgag 1140 
ctacggaagc ctggccgcct gcccgacacg gcactgccca cgctgcagaa tggggcagct 1200 
gtgactgatc tggcctggga cccctttgac ccccatcgcc tcgctgtggc tggtgaggac 1260 
gccaggatcc gactgtggcg ggtacccgca gagggcctgg aagaggtgct caccacgcca 1320 
gagactgtgc tcacaggcca cacggagaag atctgctccc tgcgcttcca cccactggca 1380 
gccaatgtgc tggcctcgtc ctcctatgac ctcactgttc gcatctggga ccttcaggct 1440 
ggagctgatc ggctgaagct gcagggccac caagaccaga tcttcagcct ggcctggagt 1500 
cctgatgggc agcagctggc cactgtctgc aaggatgggc gtgtgcgggt ctacaggccc 1560 
cggagtggcc ctgagcccct gcaggaaggc ccagggccca agggaggacg cggagctcgc 1620 
attgtctggg tatgtgatgg tcgctgtctg ctggtgtctg gctttgacag ccaaagtgag 1680 
cgccagctgc tcctatatga agctgaggcc ctggccggcg gacccttggc agtgttgggc 1740 
ctggacgtgg ctccctcaac cctgctgccc agctacgacc cagacactgg cctggtgctc 1800 
ctgaccggca agggcgacac ccgtgtattc ctgtacgagc tgctccccga gtcccctttc 1860 
ttcctggagt gcaacagctt cacgtcgcct gacccccaca agggcctcgt cctcctgcct 1920 
aagacggagt gcgacgtgcg ggaagtggag ctgatgcggt gcctgcggct gcgtcagtcc 1980 
tccctggagc ctgtggcctt ccggctgccc cgagtccgga aagagttctt ccaggatgac 2040 
gtgttcccag acacggctgt gatctgggag cctgtgctca gtgccgaggc ctggctgcaa 2100 
ggcgctaatg ggcagccctg gcttctcagc ctgcagcctc ctgacatgag cccagtgagc 2160 
caagcccccc gagaggcccc tgctcgtcgg gccccatcct cagcgcagta cctggaagaa 2220 
aagtctgacc agcaaaagaa ggaggagctg ctgaatgcca tggtggcaaa actggggaac 22 80 
cgggaggacc cactccccca ggactccttt gaaggcgtgg acgaggacga gtgggactag 2340 
cctgcgcccc cgtcacctcc acctcacctg tgctgccact tcctagtgca cacctcacgg 2400 
ctcatcctca agctggaaga tacctctctg gccccggcac atgtcacccc tgcactcctg 2460 
ccttcccgtg ggcacttcca catcctctgg gcctctggca gttcccaggg actgttttca 2520 
cctctgctgt ctctggggtc agctgctgct catcagctgc ccgctagcat gtggccaggg 2580 
gtgcagggtg gcggggggtc agcagcatgt ccctgggcag gccctgggca ccctgtctcc 2640 
cctggtctca ctgctgacct gggctggtcc cagcctggat tggcctcatc caggatcttt 2700 
ggtcacccca cgctgcccca tcttgcctgc tgttccagtt ctggtcaagg gccttggggg 2760 
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ctggcccccc accaggcctt ctagagcagc 
ctacttccca ggtttgtagc caggagaagg 
ggggtgtgag ccccgcaggg actggcccgc 
ttttttaagc tggcaaaaaa aaaaa 



accagtctca gggccctggg accagctgcc 282 0 
gggcatcaca gagctgatgg tccaataagg 2880 
acctgccttg gatgttttca gcaattaaac 2940 

2965 



<210> 87 

<211> 2823 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2786701CB1 



<400> 87 

cggaggcagc ctagcctcgc gccccgcccg 
gtcgccggga ccagccgctc ggggccgggc 
gcgaccatgg cctcctccga ggtggcgcgg 
aaaacaactt gtatgtcttc acaaggatca 
cgttcgttga ctatgtctgg ccatgttggt 
agatccattc agcaaggttt ctgctttaat 
aaatcaacac tgattgacac attgtttaat 
ttttgcccaa atgttaaact taaagctcag 
ttgaaattga ccattgtgaa tacagtggga 
taccaaccaa tagttgacta catagatgct 
aagattaagc gttctctctt tacctaccat 
atttcaccga caggccactc tctgaagaca 
agcaaggtaa acattatacc agtgattgcc 
cagaagttta agatcaagct catgagtgaa 
ttcccaacgg atgatgacac tattgctaag 
tttgctgttg tgggaagtat ggatgaggta 
cagtaccctt ggggtgttgt acaagtggaa 
cgggaaatgc tcatttgtac aaatatggag 
tatgagcttt acaggcgctg caaactggag 
aacaagccag tcagtgttca agagacctat 
cgtcagagga aggaagaaga aatgaaacag 
gccatattga aagaagctga gagagagcta 
caccaagaag agagaatgaa gcttgaagaa 
gctttctcta aaaagaaagc tacctccgag 
ggcagcaacc tgaggaagga caaggaccgt 
tccagagcac agaaggtcat catcacaagc 
ctttgatttt gctgttattt gttttatcac 
gatatttatg gaaaagtact ttcaagtaca 
ctacaagttg agcatctcta attcaaaaat 
tttttgagca ctgacttgac cccacaagtg 
tgatggttca gtttaaacag attttgtttc 
aggctatatg tataaggtgg atgtgaaaca 
gtgtatatgc agatattcca aacctgaaat 
ataagggata ctcagcttgt acctatatat 
taagttgctg ttctgtgatg aatctaaatc 
cagtgcatta taccaaagag cgaagtcagt 
gtggctatct gattacattt atattccaag 
tggggaatat gttttgggat gtattatgga 
agaacttaaa aattttttat acaattaccc 
cagattattt aatatgatga cttcatgctt 
tgtggtgaaa caaaatatct ttaaaagtta 
taaggataaa gtacctttga gcatgagtgt 
tacatacttg tttaaactct gtcttccagg 
aaaaatactt tttaagaaaa taactatgaa 
ataattcata tttcaggtgg tatttgcatt 
taatgatttc taacatttct taacagtcat 
aatattctaa ataaaactga catttactct 
aaa 



ttgcttctgc cctccggcct tcccgccgcc 60 
tgatacagcc gcttcaccgt gcccctgccc 120 
cacctgctct ttcagtctca catggcaacg 180 
gatgatgaac agataaaaag agaaaacatt 240 
tttgagagtt tgcctgatca gctggtgaac 300 
attctctgtg tgggggaaac tggaattgga 360 
actaattttg aagactatga atcctcacat 42 0 
acatatgaac tccaggaaag taatgttcaa 480 
tttggtgacc aaataaataa agaagagagc 540 
cagtttgagg cctatctcca agaagaactg 600 
gattctcgca tccatgtgtg tctctacttc 660 
cttgatctct taaccatgaa gaaccttgac 720 
aaagcagata cggtttctaa aactgaatta 780 
ttggtcagca atggcgtcca gatataccag 840 
gtcaacgctg caatgaatgg acagttgccg 900 
aaagtcggaa acaagatggt caaagctcgc 960 
aatgaaaacc actgtgactt tgtaaagctg 1020 
gacctgcgag agcagaccca taccaggcac 1080 
gaaatgggct ttacagatgt gggcccagaa 1140 
gaagccaaaa gacatgagtt ccatggtgaa 1200 
atgtttgtgc agcgagtaaa ggagaaagaa 1260 
caggccaaat ttgagcacct taagagactt 1320 
aagagaagac ttttggaaga agaaataatt 13 80 
atatttcaca gccagtcctt tctggcaaca 1440 
aagaactcca attttttgta aaacagaagt 1500 
aaactttatt aaaaaaaaac tagaagtgtg 1560 
ttctatattt ggtgaacagc cacagttact 1620 
aggtcaatac ataagccaga gtgaatgata 1680 
ctgaaatcca gaagcttcaa aatctgaatc 1740 
gaaaattccc cacccgacac ctttgctttc 1800 
ttgcacaaaa tttttgtata aattactttc 1860 
tgaattatgt aattagagtc gggtcccgtt 1920 
ccaaaacact tctggtccct agcattttgg 1980 
tcatatatat tcactgttgt tagaaatgtt 2040 
ttttctcttg ctaccaagct attgtcactg 2100 
gccactgaaa atacagaacc cattaatatc 2160 
atgaaccttt tttatatatg ctaaaaattt 2220 
gctaaaactc taacctctta atagttttat 22 80 
aattggtgat atgatcttaa gcttttgtgt 2340 
tattatgcct tattatggct gacgtattac 2400 
aaacatccag atatataagc tattttttcc 2460 
atcacagctt tcattaggaa aacttttcat 2520 
gtaaaaataa taaggttgaa tcattttatt 25 80 
catctgaata ttaaagatat aaaaatgcac 2640 
cagtgcctta ctggtattct cagaacattt 2700 
agatatatac attttcattt tttgtacttg 2760 
tgacaaataa aacatatatt tactaaaaaa 2820 

2823 
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<210> 88 

<211> 1549 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3068538CB1 



<400> 88 

gcagacccgg cacgcaggtg ggggccggcg 
ggcggcggcg gctgctgtgc tgggtgcagt 
tggccaggat cctgcgctga gcacgagtca 
tctgcaggtg gcaggggatg accgctttgg 
gatgccaccc tcccacgagc tggaccacca 
ggaccaatac gttgagaacg attataccat 
gaacaagcct tccctgggct ggctccagag 
gaagaacttg aaggccctct acgtggtgca 
catcttgaag cccctcatca gtcacaagtt 
gagtgagctc cacgaacacc ttaaatacga 
gtacgatgag aagctccaga gcctgcacga 
tccgccgcgg cccccgctgc ccacacagca 
caaaaatcaa ggcgaactca tcccccctgt 
gaaaggcctg cgcaccgagg gcctgttccg 
gatccagagg ctctacaacc aagggaagcc 
catccctgcc gtgatcctga agaccttcct 
ccaggcctac gagcagattc tcgggatcac 
ctgccgccag atcttacgga gcctcccaga 
gggcttcctg catgcggtgt cccgggagag 
ggcctgtgtc ttcgggctga atttgatctg 
ccttgtgccc ctgaacatgt tcactgaact 
caccccggag gcacctgggg agcacggcct 
ccctttgcag gaggctgtgc cacggacaca 
tccgagtccc ctgatggcag ccagaagacg 
cgagctacct cccacacctg tctgtgcact 
aataaccagc cattagatga attcagaacc 



gggtccgtgg ccagagctgc agagagacaa 60 
gaggaagagg ccctcggtgg tgcccatggc 120 
cccgttctac gacgtggcca gacatggcat 180 
aagacgtgtt gtcacgttca gctgctgccg 240 
gcggctgctg gagtatttga agtacacact 300 
cgtctatttc cactacgggc tgaacagccg 360 
cgcatacaag gagttcgata ggaagtacaa 420 
ccccaccagc ttcatcaagg tcctgtggaa 480 
tgggaagaaa gtcatctatt tcaactacct 540 
ccagctggtc atccctcccg aagttttgcg 600 
gggccggacg ccgcctccca ccaagacacc 660 
gtttggcgtc agtctgcaat acctcaaaga 720 
gctgaggttc acagtgacgt acctgagaga 7 80 
gagatccgcc agcgtgcaga ccgtccgcga 840 
cgtgaacttt gacgactacg gggacattca 900 
gcgagagctg ccccagccgc ttctgacctt 960 
ctgtgtggag agcagcctgc gtgtcactgg 1020 
gcacaactac gtcgtcctcc gctacctcat 1080 
catcttcaac aaaatgaaca gctctaacct 1140 
gccatcccag ggggtctcct ccctgagtgc 1200 
gctgatcgag tactatgaaa agatcttcag 1260 
ggcaccatgg gaacagggga gcagggcagc 1320 
agccacgggc ctcaccaagc ctaccctacc 1380 
tctctagtgt tgcgaacact ctgtatattt 1440 
tgtatgtttt ataaacttgg catctgtaaa 1500 
ttctaatgaa aaaaaaaaa 1549 



<210> 89 

<211> 1722 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5159072CB1 



<400> 89 

gcaagaggga gccggcccga cgcggaccgc 
cgccgccgcc gcccggctcc gctcgcggcc 
cggagagccg tcctcggccg aggaggctgg 
caacgccgtc cctttcgcca ccgccttttc 
ttttcctttg tttttttggc ccctcgcggg 
cctcaagatg gctgatggca acgaggatct 
cgagagctat gagtccatgg agcttgcctg 
cagcgacggg cgccacatgt tcgtctgggg 
atatgacttt tatctgccta gagaagaact 
gaaaaaaatc aacactgaag gtgatgttcc 
tgtagacagg gtgctgtact tgtttggagg 
ctacatgctg gattcaaggt ctacagacag 
aggaattcct ccatcatcaa aggacaaact 
attttttgga gggtatggat atttgcctga 
tgaaacatct ttttggaatt caagtcatcc 
agatactgaa acatttacct ggagccagcc 



ttccctgcag tgccccgagt cccgggcccg 60 
cctctgtctg caggcgtgcc ccggcggcgg 120 
gaaacgcgag cgcaggcggc agagaggcct 180 
cttgcctcgc gccgctgtgc atttctctcc 240 
tgtgggcatt gttggttagc aaaagtgcag 300 
gcgggctgac gacttgcctg ggccagcctt 3 60 
ccccgctgag cgcagcggcc acgtagccgt 420 
cggctacaag agtaatcaag tcagaggatt 480 
atggatctac aacatggaga ctggaagatg 540 
tccttctatg tcaggaagct gtgctgtgtg 600 
acaccattca agaggcaata ccaataagtt 660 
agtgttacag tgggaaagaa ttgattgcca 720 
tggtgtctgg gtatataaaa acaagttaat 780 
agataaagta ttgggaactt ttgaattcga 840 
aagaggatgg aatgatcatg tacatatttt 900 
tataactact ggtaaagcac cttcacctcg 960 
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tgctgcccat gcctgtgcaa ctgtcggaaa 
agatgctaga atgaatgatc ttcactatct 
aattccacaa ggcatatgcc cagttggtcg 
agatcatctt tttctctttg gaggatttac 
gacttactgc atcagtaaaa atgaatggat 
aaggttatgg cacacagctt gtgccagcga 
tgccaacaac ttgcttgtcc atcacagagc 
agttcaacca aaatctcttg tacggctaag 
gttagccaac tcatggaact gccttccaaa 
tggtagtaac aacacttctg gatcttaagg 
atggacagca atcctgtaaa catcacagag 
tgtagtttgc acctgttggt tttaatgtgc 
gtctaaagtt tacaataaat gtatttaaca 



tagaggcttc gtgtttggag gcagatatcg 1020 
taatctggat acatgggagt ggaatgaatt 1080 
atcttggcac tcactaacac cagtttcttc 1140 
cactgataaa cagccactaa gtgatgcctg 1200 
acaatttaat catccatata ccgaaaaacc 1260 
tgaaggagaa gtaattgttt ttggtggatg 1320 
tgcacacagt aatgaaatac taatattttc 1380 
cttagaagca gtcatttgct ttaaagaaat 1440 
acacttactt cacagtgtta atcagaggtt 1500 
cttcataaat aatgcctatg atcaccttgc 1560 
tggcatcatt tgtataatta tatgcattgt 1620 
atgtgaatgg cctagagaac ctatttttgt 1680 
ccaaaaaaaa aa 1722 



<210> 90 

<211> 1264 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5519057CB1 



<400> 90 

agcgcgcgct cttgcggtgg cgtaatctct 
cagtttggtg cgggtcgggg gaatggctga 
ttcgtccagc ggggcagtgt caggggcctc 
cttcaagata atcgtgatcg gcgactccaa 
ctgcgctggc cgcttccccg accgcaccga 
agcggtggag attgatgggg agcgcatcaa 
acgattcaga aagagcatgg ttcagcacta 
gtatgatatg accaacatgg ctagttttca 
acaacatttg ctagccaatg atataccacg 
aagtgccata caggtaccca cagacttggc 
tttgtttgaa acgtctgcta aaaaccccaa 
gaccttggct cataagctta agtgccacaa 
tggaattatc ctgaagcctg aaccaaagcc 
ctttattata ttatctaatt ttgactaaag 
cataagattt aatctcaact ataatgggtc 
cgcttttgta ttttgtatct acttaagttt 
ttcaggtgag attgaaaatg aagcaaagat 
gcccaatgaa ggaagcttca aatgagaaca 
tcctaggatt ggaaaaaaat gttaaaggtt 
tgggaagttt tcttaatagg ataaaaactg 
ttttttccat agagacgggg tcttgctatg 
caag 



cagcctttct gtgtctcctt tcctccgcct 60 
ggagatggag tcgtcgctcg aggcaagctt 120 
agggtttttg cctcctgccc gctcccgcat 180 
tgtgggcaag acatgcctga cctaccgctt 240 
ggccacgata ggggtggatt tccgagaacg 300 
gatccagcta tgggacacag caggacaaga 3 60 
ctacagaaat gtacatgctg ttgtcttcgt 420 
tagcctacca tcttggatag aagaatgcaa 480 
gattcttgtt ggaaataaat gtgacttgag 540 
acaaaaattt gctgacacac acagtatgcc 600 
tgataatgac catgtggaag ctatatttat 660 
accattaatg cttagtcagc cccctgataa 720 
tgcaatgacg tgctggtgct aaataacagt 780 
aaatactttt gaagtatgac agtattaagt 840 
atcttgacac tttgctgttt gtcattgtca 900 
gtcactgtga caacacagga aaagttggtt 960 
aggatgaatc tgaacatctc tccatctaga 1020 
tgatggaatc agtaaccatt caatcttttg 1080 
taggacacac ctaatagtat gtcctttgaa 1140 
gtatttcctt ctccccagag tacttttttg 1200 
ttgtccaggc tggccttgag ctcctgggct 1260 

1264 



<210> 91 

<211> 2640 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 035379CB1 



<400> 91 

cggccgaggg ggcatcatga agcgggctgg 
gaggtgcttc ccaaggaccg tagatgcctc 
gctacaaccg cttctccggg gggcccagca 
ccagaatgga aacgaccttc ggacccgcct 
acgggaccag cacctacaag cagcactgca 
actccccgcg ggacgaggag gacagcatgc 



cggcgctgcg tcccgggcgc gcgcgggcgg 60 
tctagagcat gagctcaggc aagagtgccc 120 
atcttcccac cccagacgtc accacaggga 180 
tttcagccgt caccaccatc acaaaagctg 240 
ggacaccctc ctcctccagc acccttgcct 300 
cccccatcag cactccccgc cgctccgact 360 
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ccgccatctc tgtccgctcc ctgcactcag 
cactgcccga ggaggaggag gagccggagc 
agctgtgctg tcagctctgc tgcagcgtct 
acacgttctg taggagatgc gccttgaagt 
tgaccgtggt ggtgaacaac atcgcggtgg 
gccggcacgg ctgccgggta gcgggcagcg 
gagggtgccc cttcaccatc aagctcagcg 
acaggcctgt gcggtgtccc aacaacccca 
aggcccacct caaggagtgc gagcacatca 
tcatcgggaa ccaggacact tacgagaccc 
aggagtttct gcagcagacg gatgaccgct 
aggaccagga gatcgccttc ctgcgctcca 
agctagagaa gagcctggag ctcaagtttg 
gcgaggacct catggagttc cggcgggacg 
tcaacgcgcg gctgaacatg ggcatcctag 
gcaaagggac ctttgtgggc caccagggcc 
gtgacctgct cttcagtggc tcctctgaca 
cctacaagtg tcagaagaca ctggagggcc 
aggggtgcaa actctacagc ggctctgcag 
acctgcagaa ggtgaacacc atccgggccc 
cacacaacgt gctcttcagc ggctccctga 
ctgagctgaa gttgaagaag gagctcacag 
ctgcccagag ctacctgtac agcggctcct 
cccttgactg catccacgtc ctgcagacgt 
caaatcacca cattgtctgt ggcacctacg 
ccaaggagca ggtgcggacc ctcacgggcc 
tctcgacgcc agaccagacc aaagtcttca 
ggagtatgga caacatgatc tgcacgcaga 
cgctggctgt gtcccggggc cgactcttct 
ggacttgcta acaggatcca ggccaggctg 
ctgagccagg ctggccacat ggggtggtct 
tggacaggct ctggcagccg ggcagtgccc 
tactcggcac tgtccttgct gcccagcccc 
ggcccaccct ccatccccac cctccatccc 
caccttttct accgttttta gactgtatgt 
ctccacagac tgtggctgtg agtggggaca 
cttgaggttg gtgtgcacag gcactggctg 
tttggtggac cccaggactt cggcccactc 



agtccagcat gtctctgcgc tccacattct 420 
cactggtgtt tgcggagcag ccctcggtga 480 
tcaaagaccc cgtgatcacc acgtgtgggc 540 
cagagaagtg tcccgtggac aacgtcaaac 600 
ccgagcagat cggggagctc ttcatccact 660 
ggaagccccc catctttgag gtggaccccc 720 
cccggaagga ccacgagggc agctgtgact 780 
gctgcccccc gctgctcagg atgaacctgg 840 
aatgccccca ctccaagtac gggtgcacgt 900 
acctggagac ttgccgcttc gagggcctga 960 
tccacgagat gcacgtggct ctggcccaga 1020 
tgctgggaaa gctctcggag aagatcgacc 1080 
acgtcctgga cgaaaaccag agcaagctca 1140 
catccatgtt aaatgacgag ctgtcccaca 1200 
gctcctacga ccctcagcag atcttcaagt 1260 
ctgtgtggtg tctctgcgtc tactccatgg 1320 
agaccatcaa ggtgtgggac acatgtacca 1380 
atgatggcat cgtgctggct ctctgcatcc 1440 
actgcaccat cattgtgtgg gacatccaga 1500 
atgacaaccc ggtgtgcacg ctggtctcct 1560 
aggccatcaa ggtctgggac atcgtgggca 1620 
gcctcaacca ctgggtgcgg gccctggtgg 1680 
accagacaat caagatctgg gacatccgaa 1740 
ctggtggcag cgtctactcc attgctgtga 1800 
agaacctcat ccacgtgtgg gacattgagt 1860 
acgtgggcac cgtgtatgcc ctggcggtca 1920 
gtgcatccta cgaccggtcc ctcagggtct 1980 
ccctgctgcg tcaccagagc agtgtcaccg 2040 
caggggctgt ggatagcact gtgaaggttt 2100 
tggtttcccc tgaaccagcc ctggaccttt 2160 
cggggtttct gcctgccccg tgggcatagg 2220 
tccccgtccc atgctcggcg agcctccctc 2280 
tctctgggtg ccaggtacga cgcttgcccc 2340 
caccctagat ggagcgaggg cctttttact 2400 
agattggtta cctcctggtt gaaataaatg 2460 
gctcctcggg acaagggggc tgtgtgtggc 252 0 
ctgtgagtgg gggggcatgg ggcagtttcc 2580 
cggggcctcc cctccctgct aggaggtaac 2 640 



<210> 92 

<211> 2071 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 275354CB1 



<400> 92 

gtgggcggaa ctcctagcgg acacctcgtg 
ggtgaagatg ctgagccgga atccggacaa 
gagagttcca agaaactatg atcctgcttt 
aagagcttta aatgctacca aactggaacg 
ggatggtcac cgtgatggag tcaattgctt 
cctttctggg gcgtgtgatg gagaggttag 
ccgtacaata caagcacatg aaggctttgt 
ttcttttttc actgttggtg atgacaaaac 
ctatggagac gaggaagagc cattacatac 
tgatcatcac tggaaagaag ctgtttttgc 
tgaacaaaga actaatccta tatgttcaat 
taaatttaac ccaattgaga catttctctt 
actgtacgat atgaggcaag ctactccttt 
tacaatctgt tggaacccta tggaagcttt 



gagtccggcc ggaagagcaa ccgagatgaa 60 
ttatgtccgc gaaaccaagt tggacttaca 120 
acatcctttt gaggtcccac gagaatatgt 180 
agtatttgca aaaccattcc ttgcttcgct 240 
ggcaaagcat ccagagaagc tggctactgt 300 
aatttggaat ctaactcagc ggaattgtat 360 
acgaggaata tgtactcgct tttgtgggac 420 
tgtgaagcag tggaaaatgg atgggccagg 480 
aatattagga aagacagtgt atactgggat 540 
cacatgtgga cagcaagtag acatttggga 600 
gacctgggga tttgacagta taagtagtgt 660 
gggaagttgt gcatctgaca ggaatatagt 720 
gaaaaaggtt atcttagata tgagaacaaa 780 
catttttaca gcagcaaatg aagattataa 840 
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cttatatact tttgatatgc gtgcactgga 
atctgcagtg cttgatgtgg attactctcc 
cgataaatct attcgaatct ttcctgtaga 
aaagagaatg caacatgtta tctgtgtaaa 
tggatctgat gaaatgaaca ttcgcctgtg 
gcttacatca cgagaaaaag cagccaagga 
gcattatcct catataaaac gtatagctcg 
ccagattcag gaacagcgca tcatgaaaga 
taaacacagc aagcctggat ctgtgccact 
agttgtaaaa taattggtat tcctaacaat 
ttgagaactc tacaaataaa agtgctggga 
gtgtagagct ttattgttac tccttttagc 
tagttggtaa gactgtttta tccttaatct 
ttgcaactca ttttttcttg tttttattac 
ttgtagacac tatacattca aattgacatt 
atattacttt gaataatgat tgcaatgatg 
aataatgatg tcaatttttt acaactgaat 
ttttgactct tttaaaacaa tcagcctgca 
tttgggtcaa gttaagatat taaaagttcc 
gtaaataatt ttcaatctca ctaaatccta 
ttttaatcat aaacttaata aactttgtgg 



cactcctgta atggtccata tggatcatgt 900 
cactgggaag gagtttgtgt ctgctagttt 960 
caaaagtcga agcagggagg tatatcatac 1020 
atggacttct gacagcaagt atattatgtg 1080 
gaaagctaat gcttctgaaa aattgggtgt 1140 
ttataaccag aaattgaagg agaaatttca 1200 
tcatcgacat ctaccaaaat ctatctatag 1260 
agctcgtcga cgaaaggaag tgaatcgtat 1320 
tgtgtcagag aagaagaaac acgtagtggc 1380 
cctgatgtat aattatttgt tacttttgat 1440 
ctagattaat tgcaaacatt ttagttatat 1500 
taccctgaaa aatgatcctt aaaggtggcc 1560 
gcattcttct ttcattgtag aatacagtat 1620 
agatatactt actttctctt tgatctatta 1680 
taagaccaaa catctcttat gttatcttta 1740 
tttcttcctg tgattccaca taacatttag 1800 
ttatttctag tgctttactt atatttggct 1860 
tttatataac ttttataaat aataatataa 1920 
tttcagcatt gaaactttgg cctatttttg 1980 
aatagctctg tgtaacatag gtttttcttt 2040 
a 2071 



<210> 93 

<211> 2149 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 311658CB1 

<400> 93 

cattattttt aaaaatatta cccactcttg 
aagctatata ttgtttgaca tcccaatcta 
agttcctcct cttcatttat attcttttct 
atgcaaagga gaaagataca attttaagcg 
ggttacataa aacttgggtt ttcagagatg 
aggaatgagt gatttacttt cccctctttt 
ttggtgcttt gcctcttaca tggaccaaat 
catgaagacc cagctaattc agctgagtac 
cagttactta gaatctcagg actctggata 
cagattcaaa agggaattta gttttctaga 
cgaactacca tgtacaaatt tccatcttct 
gcagcaaata atggaaaagc attatggctt 
gtccatgaaa attgatgtgg aagatatact 
ggtaaaatgc aaggaattgc cacaagcagt 
agttacaaca ccagattcag acgttggtga 
tacatctgca tttcaaagta atgccttgcc 
cagcccaaca cagataccag tgtcctcaga 
gttcttgctt ttttgggaag agacactttg 
gaaaatttga aatcttggta ttgatcatgc 
tgttcttaca ttaaagcttt acaaagattt 
aaatagcctt tccttttcga taacattcct 
ttcttgctga tgaactggaa ttggataaat 
ctttgaaaaa cattcacttt gtttaagctt 
tggaggcttt ctatagcatt ctaagctgag 
aaataaaaat aaaaggattt ttttctctat 
atgctggtca aatgtgattt aaattggaca 
aatatttcaa ccataataag tggattggca 
ttgtatataa agattatata taagtactta 
ttcattaact gaataaacga cttgatttat 
ttcagtatga gatctttttc aaaactattt 
caaccctgta cctggtgtaa ttttaaaatt 



atagtgtatc tgcactgaga cacgtactgg 60 
aaacaactca tctttcttac ttatgactag 120 
tggtgaacat cagtgtctac caatttctaa 180 
aaatggtggt gatatgcaca acttgcagaa 240 
attttttctc ttctttttag gatatgttca 3 00 
atatgtgatg gaaaatgaag tggatgcctt 360 
gcatcagaat tttgaagaac aaatgcaagg 420 
cttacttcga ttgttagaca gtggattttg 480 
cctttatttt tgcttcaggt ggcttttaat 540 
tattcttcga ttatgggagg taatgtggac 600 
tctctgttgt gctattctgg aatcagaaaa 660 
caatgaaata cttaagcata tcaatgaatt 720 
ctgcaaggca gaagcaattt ctctacagat 780 
ctgtgagatc cttgggcttc aaggcagtga 840 
agacgaaaat gttgtcatga ctccttgtcc 900 
tacactctct gccagtggag ccagaaatga 960 
tgtctgcaga ttaacacctg catgatcact 1020 
ttgcaaccct ttttcaagta cttgaaagtt 1080 
tttaaggttt atgtaaagaa agtgtactga 1140 
aaactaatta tttttgtagt tacttctacc 1200 
cagtattttt atagccaagt acattttatt 1260 
attgcaagtg gatgagttgg aaattatgca 1320 
attgggtttc agatttgatt aaattaaatg 1380 
aagtagattg ttacccagta atgaaataaa 1440 
tgtttacgac agtactcagc ttaaatattt 1500 
ttttcatcaa tgcagtctaa tgtgtagata 1560 
gtatattttt tacattgaac ttttcttcac 1620 
tttatgagta taagaaaggt taggcatatt 1680 
ataacctggt ttatcaaaat ttaacatggc 1740 
tcttaaacat ttatttcatg agattatgtt 1800 
aattgcttgt aacctcactt tactaataat 1860 
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gtttattatc tttcctaata atgcattaac tgattaatca ggtgtttaaa tttttataaa 1920 
atactcttgc aaaaagttta tttgaaaaat ttctagatgg tctcatgagt ttcaaaataa 1980 
taatttttgt gtatgaacaa agctgttgtt tttaccatgc agtattgcat gattttaagt 2040 
tatgtggaat taacataact gattttgttt taattgtaag ttgttaactc ctgtatatat 2100 
cattaaaata aatctgaagt tgaagtagtg tttttagtta aattatact 2149 

<210> 94 

<211> 2332 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1251632CB1 

<400> 94 

gccaccccag aactgggcag cagcctcaag aagaagaagc ggctctccca gtcagatgag 60 
gatgtcatta ggctaatagg acagcacttg aatggcttag ggctcaacca gactgttgat 120 
ctcctcatgc aagagtcagg atgtcgttta gaacatcctt ctgctaccaa attccgaaat 180 
catgtcatgg aaggagactg ggataaggca gaaaatgacc tgaatgaact aaagccttta 240 
gtgcattctc ctcatgctat tgtgaggatg aagtttttgc tgctgcagca gaagtaccta 300 
gaatacctgg aggatggcaa ggtcctggag gcacttcaag ttctacgctg tgaattgacg 360 
ccgctgaaat acaatacaga gcgcattcat gttcttagtg ggtatctgat gtgtagccat 420 
gcagaagacc tacgtgcaaa agcagaatgg gaaggcaaag ggacagcttc ccgatctaaa 480 
ctattggata aacttcagac ctatttacca ccatcagtga tgcttccccc acggcgttta 540 
cagactctcc tgcggcaggc ggtggaacta caaagggatc ggtgcctata tcacaatacc 600 
aaacttgata ataatctaga ttctgtgtct ctgcttatag accatgtttg tagtaggagg 660 
cagttcccat gttatacgca gcagatactt acggagcatt gtaatgaagt gtggttctgt 720 
aaattctcta atgatggcac taaactagca acaggatcaa aagatacaac agttatcata 780 
tggcaagttg atccggatac acacctgcta aaactgctta aaacattaga aggacatgct 840 
tatggcgttt cttatattgc atggagtcca gatgacaact atcttgttgc ttgtggccca 900 
gatgactgct ctgagctttg gctttggaat gtacaaacag gagaactaag gacaaaaatg 960 
agccagtctc atgaagacag tttgacaagt gtggcttgga atccagatgg gaagcgcttt 1020 
gtgactggag gtcagcgtgg gcagttctat cagtgtgact tagatggtaa tctccttgac 1080 
tcctgggaag gggtaagagt gcaatgcctt tggtgcttga gtgatggaaa gactgttctg 1140 
gcatcagata cacaccagcg aattcggggc tataacttcg aggaccttac agataggaac 1200 
atagtacaag aagatcatcc tattatgtct tttactattt caaaaaatgg ccgattagct 1260 
ttgttaaatg tagcaactca gggagttcat ttatgggact tgcaagacag agttttagta 1320 
agaaagtatc aaggtgttac acaagggttt tatacaattc attcatgttt tggaggccat 1380 
aatgaagact tcatcgctag tggcagtgaa gatcacaagg tttacatctg gcacaaacgt 1440 
agtgaactgc caattgcgga gctgacaggg cacacacgta cagtaaactg tgtgagctgg 1500 
aacccacaga ttccatccat gatggccagc gcctcagatg atggcactgt tagaatatgg 1560 
ggaccagcac cttttataga ccaccagaat attgaagagg aatgcagtag catggatagt 1620 
tgatggtgaa tttggagcag acgacttctg tttaacttaa aattagtcgt attttaatgg 1680 
cttgggattt ggtgcaaaca aacatgattg atagctggac agacatgctc gtcatgaaaa 1740 
aagaaccatt tctgaagccc gattggggcc aaacatttac accttgcttc atagtaacca 1800 
gttgagatga agcacgtcgt tagaacgttg ttggacacca tgttgaatta ttcccccatc 1860 
ggttgtgaag aactgtgcta cattcaggct tacccattga actcagtata tatatttttt 1920 
ttccttcctg tcttttgtct ggcaggatac cattcttgtt gctcttctgt gtaatgaagt 1980 
ttaaatgctt gtttggaaaa ctttatttaa cagtttagaa ggcttgatag aaagagtgca 2040 
ttagtctgaa gagtatacat tggataggaa agaatttcct tcttttgttt ctccaaatct 2100 
ttccgcctta tttagcttga gatctttgca gcttggttca tggattctag ccttgcccgt 2160 
tgcgcagtat atactgatcc agatgataaa ccagtgaact atgtcaaaag cactctcaat 2220 
attacatttg acaaaaagtt ttgtactttt cacatagctt gttgccccgt aaaagggtta 22 80 
acagcacaat tttttaaaaa taaattaaga agtatttata ggaaaaaaaa aa 2332 

<210> 95 

<211> 1751 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
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<223> Incyte ID No: 1331955CB1 



<400> 95 

gcccgaatcg actccggaga caacgccggg 
ctggtgtctg tgtacagggc gtgctgtctg 
tggaaggacc ccagacccac caagccactc 
ctgcctctgt caccgggtcc ctccaccctt 
gcgacggagg agaagaagcc cgagaccgag 
tccgccactc agagcaagcc tacacctgtg 
gctggccaca ccaaagcagt gtcctccgtg 
agttcatctg ctgataaact tattaaaatt 
accatatctg gtcacaagct gggaatatcc 
cttgtttctg cctcagatga caaaaccttg 
ctgaaaaccc tgaagggaca cagtaattat 
aaccttattg tctcaggatc ctttgacgaa 
aagtgcctca agactttgcc agctcactcg 
gatggatcct tgatagtttc aagtagctat 
tcaggccagt gcctgaagac gctcatcgat 
ttctccccga acggcaaata catcctggcc 
gactacagca aggggaagtg cctgaagacg 
atatttgcca atttctctgt tactggtggg 
cttgtttaca tctggaacct tcagacgaaa 
gatgtcgtga tctcaacagc ttgtcaccca 
gaaaatgaca aaacaattaa actgtggaag 
gcgagagact gtcgggaagt tgacccggat 
cccccagatc tgcgcctggg ggtcaggaca 
gatttggccg agcggaaggt gtggaccacc 
tcttgccaat tctaacactg tctagggaag 
caacaaaagt ttttaatttt ttattacaga 
ttttttcagt aaacgttctg tatctttttg 
tgtcaccgcc accgtggccc cgccagctgg 
attctctgct gcgtagatgc cctggcccag 
taccgagctc g 



cgacgccacc tgcgcaggtc ccggaggccg 60 
tggaaacgcg agggcacact agcactttcc 120 
agtccctgga cgagtttcca ctcaccccga 180 
gtctcctgtg cggccagcgt cagagccatg 240 
gccgccagag cacagccaac cccttcgtca 300 
aagccaaact atgctctaaa gttcaccctt 360 
aaattcagcc cgaatggaga gtggctggca 420 
tggggcgcgt atgatgggaa atttgagaaa 480 
gatgtagcct ggtcgtcaga ttctaacctt 540 
aagatatggg acgtgagctc gggcaagtgt 600 
gtcttttgct gcaacttcaa tccccagtcc 660 
agcgtgagga tatgggatgt gaaaacaggg 720 
gatccagtct cggccgttca ttttaatcgt 780 
gatggtctct gtcgcatctg ggacaccgcc 840 
gacgacaacc cccccgtgtc ttttgtgaag 900- 
gccacgctgg acaacactct gaagctctgg 960 
tacactggcc acaagaatga gaaatactgc 1020 
aagtggattg tgtctggctc agaggataac 1080 
gagattgtac agaaactaca aggccacaca 1140 
acagaaaaca tcatcgcctc tgctgcgcta 1200 
agtgactgct aagtcccttt gctcctgccc 1260 
tggcaagaaa cagggtgtct tggaggtggt 1320 
gggcctgatt tgagcctcct ctctgaagat 1380 
ggaaagttct taaaagttgc tggtgacatt 1440 
agttcctagt ctattgtgtt caaacagagt 1500 
agggtgaagt tcaatttaac atgcgttgtg 1560 
atattccatg acccagtgca cgctgtggcc 1620 
cctccccttt ggcccacgcc ggccgccccc 1680 
ggcctgactc tccattcccg ccagtagggg 1740 

1751 



<210> 96 

<211> 1285 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1412614CB1 



<400> 96 

cggagaaaaa gctgacctaa tgaaactgtg 
agacaagcta aaagaggata tccagttgtt 
cagatggtcc caaaatgcag acaaagatga 
tgaatgaccg ggtatttgaa gaaagaagag 
cagactctca aagaagaaga atcctcacag 
aaaagttctg ctgtcgaaag cttcaagaga 
ccaagcttcc aagggtgtta tctttataca 
gtcgttgtgc acaggtgtgc tggcattgga 
tgctgaaatg tttacggttt aactggtaca 
ggatctggaa gaagcactat attcaaatgg 
cacccccaaa ggatggattt gtaatcgctg 
aggaaaaaca gtccccttta tcagcttttc 
actcagggga gaaagcactt ccaccctggc 
ttcgttttaa ttacctagac aaccgtgacc 
aaagaaacca aataacccca gacttcagcc 
aggacagaac taggctaaga aaagcacaat 
tatgtcccta agtgccagct ctcccctaaa 
tgagtcagtg ggactcccag ccactgccac 
caggcaccct gggccccaag catgactcat 



gcaacgtcag cgcttgaggc ttgaagaggg 60 
tcatgatggc atttgcacct ccaaaaaaca 120 
gcacctggac acccctaaac catcagctat 180 
ccctgcttgg caaatggttt gacaaatgga 240 
gcctgttgga gcgctgctcg ctgtcccagc 300 
aaattccagc agaagccctg gactttacaa 3 60 
tcttttcttt cctggaccct cggagccttt 420 
agaaccttgc tgagctggac cagctctgga 480 
tcaatttctc tccaactccc tttgagcagg 540 
tgaaagaact tcatattacc aagcctaaga 600 
acgttcaact agttacaagc aattctccag 660 
ggtcctcttc ctctttaaga aagaagaata 720 
gatcttctga taagcaccca acagatatca 780 
ccatggagac tgtccagcaa ggaagaagaa 840 
gacagtcaca tgataagaaa aataaattgc 900 
caatgatgtc gaggagaaat cccttcccac 960 
agttccagct catctcgcct ggcctccccc 1020 
cacagctgaa attctcatgc agcatcctca 1080 
ccaggttcca gagccaaagt ggactgaaca 1140 
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tgggaagact tttattatag aaatgacaag atgctttgca cagtggagag ctgaatttac 1200 
ttggctccca ttagaaactc tttcagctta agtacttatt gtggtagtga gtcctacggt 12 60 
atttcagtaa aaaggaattc atggc 1285 



<210> 97 

<211> 3260 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1750781CB1 



<400> 97 

ccggaagacc gtcccggatg gcctcgggga 
tgccggcatt cccgcttctg ctggttgctt 
ctcgcattgg tggcgctgag gtgccggggc 
cgctgacttc ccccgctgga agcgccacat 
gcagagacag gcgttcgagg agatcatcct 
tcttcattca gtgttggccc agaaactaca 
cgagataagt cccggacatg atggcacatg 
actgaggatt aagcaccaag aggaactgac 
tcaactggtg attgacctga ataaccaaat 
tgaagcaaaa attgcagaat gtttgcagac 
cctgcgcact aagctttgtg accttgaaag 
tgccctgcag atcactttta ctgccttgga 
ccaggagctg gtcaccagat ggatggctga 
agagaatgaa aaagactcca ggaggcggca 
agcaaaggaa cctctaccag tcgaacagga 
ttctgatcac acagaagaga cctctcctgt 
ctctgtctct tccttcccag tcccccagga 
agaagtgagg gtaccagcta ctgccttgtg 
cgctgtgcag ttcagtccag gttcccggtt 
taagctttgg gaagtatttg gagaaaaatg 
tgcaggaatt acaagcattg aatttgatag 
tgattttgca agccgaatct ggactgtgga 
acacagtggg aaagtgctgt ctgctaagtt 
aagtcacgac cggactctca aactctggga 
gtttgcagga tccagttgca atgatattgt 
ttttgacaag aaaattcgtt tctgggacat 
gctgttggga aagattactg ccctggactt 
ctcccgtgat gacttgctaa aagttattga 
cagtgcacct gggttcaagt gcggctctga 
cagttacgtg gcggcaggct ctgctgaggg 
gaaagtggaa aaggttcttt caaagcagca 
gccctctggc tcgcacgttg tcagtgtgga 
gtactgacgg ggctctcagg gctgggagga 
ggctcctgca gccctgtcct ggcaggtgat 
agctcaagct atgtggcact gtagctttgc 
ggtctctctt ggcctggaag aataacactg 
aggctcaggt tcttgccttg ggaaacacta 
ggagcacagg gccccgctgg gcctcctcac 
caaggtggtg ttctctgtgc tttctctcgt 
gtcccaacac cttgggttca tttgcccggt 
actcccgaac tacagacccc tccctggtgg 
atgtcctcac atctcttcca ctgttcttca 
tcgacacatt taaaaatctc cgtgtggctt 
ttgaggtggg agaggatgtg tgaaaatctt 
aggatgtgtt cctgtatcga tctgcagaca 
ggggtgccaa gggattcgag acctccaaca 
ggcccctctg ccaagcctgt gtgcgatgcc 
cagaagcaca gcagcgccat ctttccgttt 
atttatcttt actattttac ctacgtataa 



ctgccagtgt gtggaggtga gctccgggat 60 
catgctgcag gctgcggccg tcagccctcg 120 
agcaagtgac atgtcgtcgg gcctccgcgc 180 
ctcggagcaa ctgaggcgcc gggaccggct 240 
gcagtataac aaattgctgg aaaagtcaga 300 
ggctgaaaag catgacgtac caaacaggca 360 
gaatgacaat cagctacaag aaatggccca 420 
tgaattacac aagaaacgtg gggagttagc 480 
gcagcggaag gacagggaga tgcagatgaa 540 
tatctctgac ctggagacgg agtgcctaga 600 
agccaaccag accctgaagg atgaatatga 660 
gggaaaactg aggaaaacta cggaagagaa 720 
gaaagcccag gaagccaatc ggcttaatgc 780 
agcccggctg cagaaagagc ttgcagaagc 840 
tgatgacatt gaggtcattg tggatgaaac 900 
gcgagccatc agcagagcag ccacgagacg 960 
caatgtggat actcatcctg gttctggtaa 1020 
tgtcttcgat gcacatgatg gggaagtcaa 1080 
actggccact ggaggcatgg accgcagggt 1140 
tgagttcaag ggttccctat ctggcagtaa 1200 
tgctggatct tacctcttag cagcttcaaa 1260 
tgattatcga ttacggcaca cactcacggg 1320 
cctgctggac aatgcgcgga ttgtctcagg 13 80 
tctacgcagc aaagtctgca taaagacagt 1440 
ctgcacagag caatgtgtaa tgagtggaca 1500 
tcgatcagag agcatagttc gagagatgga 1560 
aaacccagaa aggactgagc tcctgagctg 1620 
tctccgaaca aatgctatca agcagacatt 1680 
ctggaccaga gttgtcttca gccctgatgg 1740 
ctctctgtat atctggagtg tgctcacagg 1800 
cagctcatcc atcaatgcgg tggcgtggtc 1860 
caaaggatgc aaagctgtgc tgtgggcaca 1920 
ccccagtgcc ctcctcagaa gaagcacatg 1980 
gtgctgggta tagcatggac ctcccagaga 2040 
cgtgaatggg atttctgaag atttgactga 2100 
aaaaaacctg acgctgcggt cacttagcag 2160 
ctagctctga ccttccatac ctcacttggg 2220 
caacggcagt gccaaaatca gcccccacat 2280 
ccttccaaag tcggttctgg cctaacgcat 2340 
gaactcactt taagcattgg attaacggaa 2400 
gttgcatgaa tgtgtctcat tactgctgaa 2460 
gagctttctg gctctctttc cccacaaaat 2520 
taaaaaatgg ttttttgttt ttttgttttt 2580 
ttccagggaa atgggttcgc tgcagaggta 2 640 
cccagaaggt gggtgcacac tgcatgcttg 2700 
tacttgtctg aaggtggtga ttctggccat 2760 
cttggtgctt tagtgcaaga agcctaggct 2 820 
caggggttgt gatgaaggcc aaggaaaaac 2880 
agttttagtt cattgggtgt gcgaaacacc 2940 
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ctttttatca cttttaaatt tgcactttat 
tttggggatg tgagtgttag agctggtgag 
tcctggcctc cacctgccct ctcttccctg 
gagaacttaa gtcaaggaga gttgaaattc 
attatgttgg ccaacagaac ttgattgtaa 
tcaaaaaaaa aaaaaaaaaa 

<210> 98 

<211> 1276 

<212> DNA 

<213> Homo sapiens 



tttttttctt ccatgcttgt tctctggaca 3000 
agaggagtca ggcggccttc ccaccgatgg 3060 
cctgatcacc gctttccaat ttgcccttca 3120 
acaggccagg gcacatcttt tatttatttc 3180 
ataataataa agaaatctgt tatatacttt 3240 

3260 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1821658CB1 

<400> 98 

gcggcacccc caaggaagac cagcctgcct 
gaccttgtcc agtagaaggc tatttaattt 
tgaagcaaga tgcctcaaga aatgctgcct 
tggtagaatt taatcccttt gagaatgggg 
ataattatgt ggtcattggc acgtgtacgt 
ttcagtataa aacacttcga acatttcacc 
gcccagagac tagacttgat tcattgcctc 
atatgaaaat tagattattt acttcagatc 
agggccatac cgatttcatt aatggtttgg 
caagtgtgag tgacgatcac acctgcagga 
attttgttct tcattctcct ggcatgagtg 
taatggttgc agagaagaat ggaacaatcc 
ttttatctct tgaatcagaa caagtgccat 
ccttcaaagt tggagccgtt gcaggaaatg 
gttatcctca aaataagaga cctgttcaca 
caattagtga aaatctgttt gcaaccactg 
aaattcatca tttaggacac cctcagccca 
gactgtcctg gcatcgaact ctccctctgt 
tttgggtgac tgaagtataa agtgttttct 
agtacatatt ttgaagaatt tctatagtac 
ccgtatactt tagaaaatgt ctcagttgct 
aattaaaaaa aaaaaa 



ctggtcggtt cctggcgctc tgcgtttcgt 60 
tcacaactgc ttgaattttg acatacaaga 120 
acactgtgga ttgtgaagat tatgtgcatg 180 
attcaggaaa cctaattgca tatggtggca 240 
ttcaggaaga agaagcagac gttgaaggca 300 
atggagtcag ggttgatggc atagcttgga 3 60 
cagtaatcaa attttgtact tcagctgctg 420 
ttcaggataa aaatgaatat aaggttttag 480 
tgtttgatcc caaagaaggc caagaaattg 540 
tttggaactt ggaaggagtg caaacagctc 600 
tgtgctggca tcctgaggag acttttaagc 660 
ggttttatga tcttttggcc caacaggcta 720 
taatgtcagc acactggtgc ttaaaaaaca 780 
attggttaat ttgggatatt actcggtcca 840 
tggatcgagc ctgcttattc aggtggtcca 900 
gttatcctgg caaaatggca agccagtttc 960 
tcctcatggg ttctgtagcc gttggatctg 1020 
gtgtaattgg aggagaccac aagctgttgt 1080 
gtaccttaga ttcacaaact ttgtattttt 1140 
atattttgaa gaatttttat atcaaatata 1200' 
tttattaaat aaaatgttga tggtttgaaa 1260 

1276 



<210> 99 

<211> 3608 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1872574CB1 



<400> 99 

gttttggttc tagccgctcg ccgtccttgc 
cgcttcccct tccctttccc ggctcaagtc 
tcttttttct gctgccgctc cgggtccggg 
gcggccccgg ggcccagtat atgacccgcc 
tgtggctgcg gggccgcggc ggcgctgccc 
cggaaagcgc ccgcctcgcc gggagctggg 
gatcactcgc ttcacaaaag gaaaagactt 
ttgaagaacc gggaagtcag gctacagaat 
tatgcagcac agcaacttcc cagtctcctg 
aataaagtgt ttgcatctca gtggttgaat 
aacacgctat ttgtcgtaga tgtccagaca 
gaccgggagc ctggaggtgt gacccagcag 
ccttctagaa cactgctagc cactggagga 



aggctctgcc gtcggaaagc cgctcattct 60 
cttcctctct ctttcctttc tttccgccta 120 
ccattttccg ggccgggcgc actaaggtgc 180 
gtcctgctat ccttcgcttc ccccgcccca 240 
actatggccc ggaaagtagt tagcaggaag 300 
agcgacgctc agggcccgca gtttggctgg 360 
cctcctgtga agagatcctt agtatactac 420 
gaaaccagct actctcgagt gttgcatggt 480 
aaggagagag agtttcacct tgggaccctt 540 
cataggcaag tggtgtgtgg cacaaaatgc 600 
agccagatca ccaagatccc cattctgaaa 660 
ggctgtggta tccatgccat cgagctgaat 72 0 
gacaacccca acagtcttgc catctatcga 780 



90/115 



WO 01/05970 



PCT7US00/19698 



ctacctacgc tggatcctgt gtgtgtagga gatgatggac acaaggactg gatcttttcc 840 
atcgcatgga tcagcgacac tatggcagtg tctggctcac gtgatggttc tatgggactc 900 
tgggaggtga cagatgatgt tttgaccaaa agtgatgcga gacacaatgt gtcacgggtc 960 
cctgtgtatg cacacatcac tcacaaggcc ttaaaggaca tccccaaaga agacacaaac 1020 
cctgacaact gcaaggttcg ggctctggcc ttcaacaaca agaacaagga actgggagca 1080 
gtgtctctgg atggctactt tcatctctgg aaggctgaaa atacactatc taagctcctc 1140 
tccaccaaac tgccatattg ccgtgagaat gtgtgtctgg cttatggtag tgaatggtca 1200 
gtttatgcag tgggctccca agctcatgtc tccttcttgg atccacggca gccatcatac 1260 
aacgtcaagt ctgtctgttc cagggagcga ggcagtggaa tccggtcagt gagtttctac 1320 
gagcacatca tcactgtggg aacagggcag ggctccctgc tgttctatga catccgagct 1380 
cagagatttc tggaagagag gctctcagct tgttatgggt ccaagcccag actagcaggg 1440 
gagaatctga aactaaccac tggcaaaggc tggctgaatc atgatgaaac ctggaggaat 1500 
tacttttcag acattgactt cttccccaat gctgtttaca cccactgcta cgactcgtct 1560 
ggaacgaaac tctttgtggc aggaggtccc ctcccttcag ggctccatgg aaactatgct 1620 
gggctctgga gttaatgaca actccccaaa tgcagagatt tacactaact tccattctca 1680 
gtttccttgt ttcttttgat tttttttttc ctaattgtgt gaggctcttg tgttttagtg 1740 
ggaacaccaa agtttgccta tagtttaggc acttaatagg aagaagctct gtacagaaat 1800 
ctgaaagttg ttttgctttt tgttttcccc tttggtaatc aaaattttac tatcttttat 1860 
tatttctggc ttttcaacca aacattgttg ctaatcccta tttttcttta agtgacacac 1920 
attctcctgt ctctggcttc ttcaggctga aatgacatag tctttctcac ccttacttca 1980 
ctcttgagag gtagggctcc tttataatta catggttgct ctcagacttt ctgtgaaagt 2040 
ttgggagctg tgtgtgtctg tgtgtgtgtg agagagagat cttgtctgcg tgtgtgtgtg 2100 
tgatcttgtg tgcctgtagg tactgtgtgt cactgaaatt acctggagtg aggattactt 2160 
gtaattaaaa tatttataaa agaaacaact ttattcacag agtccagctt tgggactagt 22 20 
ctgtatcttg ttttttaagt ctaacaacac tgataatagg aagtaaaaac agaaaggaaa 22 80 
agaaattacc actgggaaaa tctttttagt tagattgtag gcttcctggg gcctcccatg 23 40 
ccaggactgc aaagtgatcc agccctacct gtcttcccac ctgtgtgtcc cccgtgtggg 24 00 
aagttggtgt cacttcccct tcccaccctc acatctgctt agccagtagc cacaccccta 2460 
aaacatcaga ctcaccatcc aggtgcagct ccagaggcta caaaaggctt catgggactt 2520 
gaatccccat cctagcttct ctctccttcc cctcaagacc tgatctggtt ttaaggggcc 2580 
tggagctggg agtctcaagt ctgctaagat tcacatccat agcccccgtg gctttgagga 2640 
gaatcctctc tgccattctt ccaatctccc cagtgggttt tgctattatt ttctaaattg 27 00 
ggttaagtct aagaaggtgg gggtgagcag ggggtttatc tgtgtgtagt gagtgcttca 2760 
tgtgtggaat attcattttc ttactgcagt gggacttggg gttgaagcca cccctcctac 2820 
tctgttggct tagccctgag atggtgacag gctggcctgc agtcagcatc attgtgcatg 2880 
tgacagcatc aatgtgatta gtaatttgtc tgttcctccc ttgaactgtc tgtttagtct 2940 
gaggttttta aacttgcagg cagctgactg tgatgtccac ttgttccctg atttttacac 3000 
atcatgtcaa agataacagc tgttcccacc caccagttcc tctaagcaca tactctgctt 3060 
ttctgtcaac atcccatttt ggggaaagga aaagtcatat ttattcctgc accccagttt 312 0 
tttaacttgt tctcccagtt gtccccctct tctctgggtg taagaaggga aattggaaaa 3180 
aaaattatat atatattctc cttttaatgg tggggggcta ctggagagga gagacagcaa 3240 
gtccacccta acttgttaca cagcacatac cacaggttct ggaattctca tcttcgaacc 3300 
tagagaaata ggtgctataa acagggaatt aagcaaaatg ctggatgcta tagatctttt 3360 
aattgtctta attttttttc tattattaaa ctacaggctg tagatttctt agttctcaca 3420 
gaacttctat cattttaaac tgacttgtat atttaaaaaa aaaatcttca gtaggatgtt 3480 
ttgtactatt gctagaccct cttctgtaat gggtaatgcg tttgattgtt tgagattttc 3540 
tgtttttaaa aatgtagcac ttgacttttt gccaaggaaa aaaataaaaa ttattccagt 3 600 
gcaaaaaa 3 608 

<210> 100 

<211> 1311 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2590967CB1 

<400> 100 

ggcaggatga acgctgcttt ccaagatggc . gacggaggga ggagggaagg agatgaacga 60 

gattaagacc caattcacca cccgggaagg tctgtacaag ctgctgccgc actcggagta 120 

cagccggccc aaccgggtgc ccttcaactc gcagggatcc aaccctgtcc gcgtctcctt 180 

cgtaaacctc aacgaccagt ctggcaacgg cgaccgcctc tgcttcaatg tgggccggga 240 
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gctgtacttc tatatctaca agggggtccg caaggctgct gacttgagta aaccaataga 300 
taaaaggata tacaaaggaa cacagcctac ttgtcatgac ttcaaccacc taacagccac 360 
agcagaaagt gtctctctcc tagtgggctt ttccgcaggc caagtccagc ttatagaccc 420 
aatcaaaaaa gaaactagca aactttttaa tgaggaaggc tcattgtcat ccccaagcca 480 
ggccagttct ccaggtggaa ctgtagtgta gcgacctcac tgctgcgcgc acagtctccc 540 
gggacttgga ctcgagggag tgacgaggag gagctccgag ctgcgcctga gccgtgccag 600 
ccggcggacc tcaggcggtg gacgtcggcg atagccgtgt ggacggtgac cggctcactc 660 
tgcggcgccg tgctcccgct gctcacccaa agaagttgtt tccattttaa accggtcttt 720 
tggggctgca gtaaaaaata agaaatggag ttttcttgct ttttactcta aaattcaatg 7 80 
taattaaatt tcatatatat ataatatata catatataca tagtgtaaaa taaaatgttt 840 
cttggacaag aaatcccctg aaattcagct gttatagtgc ttcactgttt ttgcactgat 900 
ttttctatac cttaggtggt cagaagacaa ccttgaatgc actcatagag aaaactgtta 960 
ctttctgacg taatgtaatt caggaagaca gacgctgcaa tcacagattt taaaaaattg 102 0 
tttgcactta aaaatagttg aatgctggtg gaaagttact ttgcagatgg gtgtaaggac 1080 
tcatggccct ctgaggtgcg gcgtgaagat gcccttttta cccgttgacg tttattttac 1140 
gtaaaataaa ctgttgtttc caatgcaatc aactctgtat tatatgtata aatattgtaa 1200 
ttctgcaatt ggggaaaata gttacttcac tagtaatttt catcatttaa gagtgatatt 1260 
tctaattcac aaaagttaat attaaaacta tcttgaatat aaaaaaaaaa a 1311 

<210> 101 

<211> 2839 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2824491CB1 

<400> 101 

ggcctgcggg aagccaagat ggcgcatagg ggttctccag gctgcagttg gcgccttatc 60 
agtatctaag cggagtgttt tggaaggagt taaggggctg tggcaaacgc cctctccgcc 120 
gtcatggccc ggcatcggaa tgttcgaggc tataactacg atgaagattt tgaagatgat 180 
gatctctacg gccagtctgt agaggatgat tattgtattt cgccgtcaac agctgctcag 240 
tttatttatt cacggcgtga caaaccttcc gttgagcctg tggaagaata tgattatgaa 300 
gatctgaaag aatcttccaa ttctgtttca aaccatcagc tcagtggatt tgatcaagct 360 
cgtctttatt catgccttga tcacatgaga gaggtacttg gagatgctgt gccagatgaa 420 
atattaattg aagcagttct gaagaacaag tttgatgtgc agaaggcttt gtcaggggtt 480 
ctggaacaag atagagtgca gagtttgaag gacaagaatg aggcaacagt atctacagga 540 
aagatagcaa aaggaaaacc agtagattcc cagacatcgc gaagtgaatc tgaaattgtg 600 
ccaaaagttg ctaaaatgac tgtatctgga aagaagcaaa ctatgggatt tgaagtgcct 660 
ggagtatctt ctgaagaaaa tggtcatagt ttccacacac ctcaaaaagg accgcccatt 720 
gaagatgcca ttgcttcttc cgatgttctt gagactgctt ctaaatctgc taatccaccc 780 
cacacgattc aagcatcaga agagcagagt tcaaccccag caccggtgaa aaagtctggc 840 
aagctgaggc agcaaataga tgtgaaggcg gaactggaga agcggcaagg agggaagcag 900 
ctactcaact tagtggtcat tggtcatgtt gatgctggga aaagtactct gatgggccat 960 
atgctttatc ttctgggtaa tataaacaaa agaactatgc ataagtatga acaggagtct 1020 
aaaaaggctg gcaaagcttc gtttgcatat gcatgggtct tggatgaaac tggcgaagaa 1080 
agggaaaggg gagtaaccat ggatgttggt atgacaaagt ttgaaaccac aaccaaagtt 1140 
attacattaa tggatgctcc aggccataag gacttcattc caaatatgat tacaggagca 1200 
gcccaggcgg atgtagctgt tttagttgta gatgccagca ggggagagtt tgaagctgga 1260 
tttgagactg gaggacaaac acgagagcat ggactcttgg tccgttctct gggagtgacg 1320 
cagcttgcag ttgcagttaa taaaatggat caggttaatt ggcaacaaga aaggtttcaa 1380 
gagattactg gaaaacttgg gcactttctt aagcaagcag gttttaagga gagtgatgta 1440 
ggttttattc ctacaagtgg tctcagtggt gaaaatctaa tcacaagatc tcagtcaagt 1500 
gaactcacaa aatggtataa aggactatgt ttattagaac aaattgattc ctttaagcct 1560 
ccccagcgat ctattgacaa accttttaga ttatgtgtgt ccgatgtttt caaagatcaa 1620 
ggatctggat tttgcataac tggtaaaata gaagctggtt atatccaaac tggtgaccga 1680 
ctactggcaa tgcctcctaa tgaaacttgt accgtgaaag gaatcactct gcatgatgaa 1740 
cctgtcgact gggcggcagc aggcgatcat gttagtctta ctttggttgg gatggatatc 1800 
atcaaaatca atgttggctg catattttgt ggccccaaag tacccattaa agcttgcact 1860 
cgtttcagag cccgaatcct catctttaat attgaaattc ctatcactaa aggatttcct 1920 
gtgctgttac actaccaaac tgtcagtgaa cccgccgtta ttaaacgatt gattagtgtc 1980 
ttaaacaaaa gcacgggtga agtcacaaag aaaaagccta agtttttgac taaaggccag 2 040 
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aatgcattgg tagagctaca gacacaaaga ccaatagctc ttgagctata taaagacttt 2100 
aaagagctgg ggaggttcat gctacgttac ggtggttcta caatagctgc tggtgttgtc 2160 
actgagataa aagaatgatg ggtcagaatt tctaccacgt ttctggatac agtgaaatag 2220 
ctaacctctg tttcaagaat gcagttatta agtcaaagga acaatgtgca attgatatgt 2280 
ttttagatga gagagaaaaa ttaaagctaa aattagctgc aaagaagtat taataatcac 2340 
ctctgcaaaa attctaagtt gccaactggc aaagaaagtc taatgttaaa aacaactttg 2400 
cctttgaaac gttaataaat ggatttactt tgctaagatt tatggcaagt gtcaaaaata 2460 
gtatctgaag atactgaatc atcatgaaat gaactctact tctggccaaa gcacaatgta 2520 
tttgcagttt tctcttttga ttcaattata ctgcacatgt tttaaggaaa agtaacttaa 2580 
ttgggttttt caggcagttg atatttgacc taagcttttt tttttttttt tttttttttt 2640 
tccagttaat gctaagaaaa gatttgggga aggttataat aaaagtattt tgtggtgacc 2700 
ataagaatgt ccctccccaa acaagtaaac ttgtgaaagt ttaatttgga attagtggaa 2760 
gctgttcctt tgaaagccaa gatattattt aagttgtaaa gccagctaat aaaatgcctt 2 820 
agtttgagca taaaaaaaa 2839 

<210> 102 

<211> 1676 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2825460CB1 

<400> 102 

gggaggcgga ggttgaggtg agaggagatc gcgccattgc actccagcct gggcaacaag 60 
ggcgaagttc tctcaaaaga aaaaaaaaaa tcgtcggttt ccttttccca tctttctttc 120 
gtgacttata ttccagaacg aagccccaca catcattaat gatttgaatc gtttctaaag 180 
tgtttcttaa atcgtttctt aaatcgtttg ttgtttcttg tctaacagtc cagaacacat 240 
attacataat ggagccggga gacagactag ggctggcttg atccggccac gcagtccagg 300 
aaaggtgctt ttcaccccca agtgcaaaat gatcaatgta ttcttccgat ctacataaac 3 60 
aagcacctcc tggtttcatt ttcgtaaagc aaaacaagca tggaagcttt actgtttcgg 420 
ctcttcaaac ttccagcaac tacactgcgc tgcatcggac tcgacgcccg ctggtgacgc 480 
acacgctgcg ccggaagtgt gaactgtctg cctccaggct ttgtcatggc ggctgctgct 540 
gcacgctgga accatgtgtg ggtcggcacc gagactggga tcttgaaagg ggtaaatctt 600 
cagcgaaaac aggcggcgaa cttcacggcc ggaggacagc cgcggcgcga ggaggcagtg 660 
agcgccctgt gttggggcac cggcggcgag acccagatgc tggtgggctg cgcggacagg 720 
acggtgaagc acttcagcac cgaggatggc atattccagg gtcagagaca ctgcccgggc 780 
ggggagggca tgttccgtgg cctcgcccag gccgacggca ccctcatcac atgtgtggat 840 
tctgggattc tcagagtctg gcatgacaag gacaaggaca catcctctga cccactcctg 900 
gaactgagag tgggccctgg ggtgtgtagg atgcgccaag acccagcaca cccccatgtg 960 
gttgccacag gtgggaaaga gaatgctttg aagatatggg acctgcaggg ctctgaggaa 1020 
cctgtgttca gggccaagaa cgtgcggaat gactggctgg acttgcgggt tcccatctgg 1080 
gaccaggaca tacagtttct cccaggatca cagaagcttg tcacctgcac agggtaccac 1140 
caggtccgtg tttatgatcc agcatccccc cagcgccggc cagtcctaga gaccacctat 1200 
ggagagtacc cactaacagc catgaccctc actccgggag gcaactcagt gattgtggga 1260 
aacactcatg ggcagctggc agaaattgac cttcggcaag ggcgtctact gggctgtctg 1320 
aaggggctgg caggcagtgt gcgtgggttg cagtgccacc cttcaaagcc tctactagcc 13 80 
tcctgtggct tggacagagt cttgaggata cacaggatcc agaatccacg gggtctggag 1440 
cataaggatg agccccaaga gcctcaagaa cccaacaagg tgcccctaga agacacagag 1500 
acagatgaac tttgggcatc cttggaggca gctgccaagc ggaagctctc gggtttggag 1560 
cagccccaag gagctctcca aacgagacgg agaaagaaga agcggcctgg gtccaccagc 1620 
ccctgacgcc cctgtgccca ctttgtaaat aaactgctga acacccaaaa aaaaaa 167 6 

<210> 103 

<211> 3206 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2871116CB1 
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<400> 103 

ccagagcgtg cgttcggtgg cccatagggg 
gattgagagg atcggggtgg ggagaccaaa 
acgcttcgct ggccatttag gagctctgct 
tcaccatggc tacagaaatt ggttctcctc 
accaggcacc tcgacagctg ttttataagc 
aacagcttac ttttgatgga aaacgaatga 
acaatccatc tgtaattaag tatttggaga 
tgcgggcaat tcagcctgat gcaggttatt 
tgaataatcc tatgaatgca gtaacaacaa 
agtgtcctgt atttgttgtt aggctgcagg 
cgtggactcc agaaggaaga cgcttggtca 
ggaatggact cactttcaat tttgaaacaa 
ccatgacgtg gtcacataat gacatgtgga 
aatattggca gtcgaacatg aacaacgtca 
gagaggccag gtttatacac aatataccat 
tattctctaa gtgtattctg ggtgcagaga 
ttctgcaccc tataaacaca atatttttct 
acctttctga agtagtgttg tcccggtatc 
ctgccgcatt ttgcactggt tttctctttt 
tcctttttat tatctactgt gtaagacaag 
tttaattatg caactgaata aaatctaaag 
aagtggaaaa aggcttaagt gaagacctgg 
aaagtatagg actaaaaccc tcacagataa 
aatgggttct tcaagctttg cagctgtaag 
ttcttctgat tcactgttga taacatcact 
aaatattttc agtgagttgt atatttttac 
gggaaaaagt tccaaataca agcctgaaga 
gcattcttta aggttgtgcc acccatgtgt 
catccactgt agtcatttgt tcatactttc 
tgcttctgct ttgtagattt tgccttagtg 
gtccttcata ggattccaaa gttcaaggga 
tctttcactg ctgggaaaaa ctgactcctt 
agtctgaatg tccttatttt aaagttttcc 
ccaggaagat gtgaaacttt tctacagacc 
ggatgtttag caatcagtct tttaatagac 
gtgtctgagg cagaagctga tgataatttt 
ctgcaggaat actgcaaaaa tagaatccat 
gatttcctcg tatcatagaa aatagagaag 
acacttgaag ttctcagaat actgatgatt 
ttcttttgat gagatgccta tgttagctga 
attcttccct ctcctgttta taatgcagca 
cacagattat ctgttaaatg ctgacttctt 
tctcacccac ggataataaa tttgctacat 
actttcttcg ttgccatgag gaaagaattc 
ggaatattta aatagggaag acatttgtgg 
tacacctcca ttcaattttt tttacatata 
agcactttct ctgactcaca gtcattttgt 
ttttcagctg taaattaaat acaagagtgc 
ttttagggct gtattccttt tccttccttg 
cactgtggtt tgatttagat agccgtcatt 
accgacaaga tagaggtatc aaagctacac 
ggtttttcag tacaccacat ttgtactaca 
ttgctgcttg tttacttttg ttgattttat 
tccgtaactg aaaaaaaaaa aaaaaa 

<210> 104 

<211> 921 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



aagatggcgg ctgctccttt ggaggagcgg 60 
caagagagac atttctggct ctgaaggcga 120 
caaagccaga cgtatcctag aaggaaaaca 180 
ctcgtttttt ccatatgcca aggttccagc 240 
gacctgattt tgcacaacag caagcaatgc 300 
gaaaagctgt gaaccgaaaa accatagact 360 
acagaatatg gcaaagagac cagagagata 420 
acaatgatct ggtcccacct ataggaatgt 480 
aatttgttcg gacatcaaca aataaagtaa 540 
aagagtttga aagcctcagt gtccttaaat 600 
ctggagcttc tagtggggag tttaccctgt 660 
tattacaggc tcacgacagc ccagtgaggg 720 
tgttgacagc agaccacgga ggatatgtga 780 
agatgttcca ggcacataag gaggcgatta 840 
tttctgtagt ccctattgtc atggttaaat 900 
tgcatgggct ctgtcagttt ctgggaaact 960 
ttgttttcac acattcacca ttttgctggc 1020 
agcctttgca atatgttaga gatgtactgt 1080 
catttatgat taataatgtg tatacgttat 1140 
aatatttcat tccaaataaa gaattcagtc 1200 
cctacagaaa. acaacttcag aattcacaca 1260 
ttggcttggt tatgccacga cttccaaagg 1320 
ctggatgtgg caaacattaa cggagtaatg 13 80 
cagatcattg tcaagaagac tctaggactt 1440 
tatgcaaatg tatacaataa gtggagttta 1500 
acatcagtga ggtatgtata gtaaaactgg 1560 
attgctgcag cctcagaata aagctaagca 1620 
gggaggaggt tgacatcttt atggaaacat 1680 
agaatcttaa cagaaattgt tggatgaaca 1740 
tcatgcccat acattgagtt tacacagctg 1800 
gtttttagag ttagttgaga aacttgatga 1860 
cttgcagcag attctttggc tttacacaca 1920 
tcaaaggtgc aacattcatg gaatagcttg 1980 
tttgaaatgg atgagaaaca ttgtatgtag 2040 
agcccacatt gtttcagctt atttcatgaa 2100 
gggagcagta ttcgtgtgtg atttaaaaga 2160 
ttattttcac cacttaaggc agcttcatgt 2220 
gaacatggat agcattagca ctaataatac 2280 
gaaaactcaa acaactgctc tgttgaagtc 2340 
cgacattcac tttaagggct tcttcactgg 2400 
cagtgttttt atttttccct gtctgagaag 2460 
tcccctgctg tgtgtcttca tgtaacagtt 2520 
gctctgatga cggcactgtt agaatctggg 2580 
tccgaggtac gtgtactaac agtactgatt 2640 
ttaaatcatc acaaaaccac aatactggct 2700 
cacaccgtct caggctcttc aaaaaaaccc 2760 
aggtttttac taccagtgtt atctttgaat 2820 
ctccccctta cttgcttatc tgtatgcatc 2880 
tagccagggt acttgttccc aacatattga 2940 
ctcctggcag tccttttaca atatgaatta 3000 
ttcttagtgt tactattttt gaaagcagtt 3060 
tggccggctt gttactaagt tcgggtggca 3120 
aattaataaa cctctatgaa attacttcat 3180 

3206 
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<223> Incyte ID No: 2942212CB1 
<400> 104 

ggtgctgatg ctgctgccat ttcatcacct ttgcgagcgc acatccatcc ctccgctctc 60 
ccggcgcctg ggcctaccca gcttcgggct cccaggccag cgatgcgctc gcggctgagc 120 
tagatcctgc cgagccgcgc tctctgaggc gtcggcgggg cgccccctcc cgccgtcccc 180 
ggtccgggcc aaggagacct gcagagccgc ggccatggag gccatctggc tgtaccagtt 240 
ccggctcatt gtcatcgggg attccacagt gggcaagtcc tgcctgatcc gccgcttcac 300 
cgagggtcgc tttgcccagg tttctgaccc caccgtgggg gtggattttt tctcccgctt 3 60 
ggtggagatc gagccaggaa aacgcatcaa gctccagatc tgggataccg cgggtcaaga 420 
gaggttcaga tccatcactc gcgcctacta caggaactca gtaggtggtc ttctcttatt 480 
tgccattacc aaccgcaggt ccttccagaa tgtccatgag tggttagaag agaccaaagt 540 
acacgttcag ccctaccaaa ttgtatttgt tctggtgggt cacaagtgtg acctggatac 600 
acagaggcaa gtgactcgcc acgaggccga gaaactggct gctgcatacg gcatgaagta 660 
cattgaaacg tcagcccgag atgccattaa tgtggagaaa gccttcacag acctgacaag 720 
agacatatat gagctggtta aaagggggga gattacaatc caggagggct gggaaggggt 780 
gaagagtgga tttgtaccaa atgtggttca ctcttcagaa gaggttgtca aatcagagag 840 
gagatgtttg tgctagtcag ttcttttatt tccaaaacat gctctcctac ttgaactgaa 900 
aagtaagaga aataaataga a 921 

<210> 105 

<211> 1367 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3685151CB1 

<400> 105 

aagaggcacg tgcgctgctg aatggagctg gtcgctggtt gctacgagca ggtcctcttt 60 
gggttcgctg tacacccgga gcccgaggct tgcggcgacc acgagcagca atggactctt 120 
gtggctgact tcactcacca tgctcacact gcctccttgt cagcagtagc tgtaaatagt 180 
cgttttgtgg tcactgggag caaagatgaa acaattcaca tttatgacat gaaaaagaag 240 
attgagcatg gggctctagt gcatcacagt ggtacaataa cttgcctgac attctatggc 300 
aacaggcatt taatcagtgg agcggaagat ggactcatct gtatctggga tgcaaagaaa 3 60 
tgggaatccc tgacgtcaat taaagctcac aaaggacagg tgaccttcct ttctattcac 420 
ccatctggca agttggccct gtcggttggt acagataaaa ctttaagaac gtggaatctt 480 
gtagaaggaa gatcagcatt cataaaaaat ataaaacaaa atgctcacat agtagaatgg 540 
tccccaagag gagagcagta tgtagttatc atacagaata aaatagacat ctatcagctt 600 
gacactgcat ccattagtgg caccatcaca aatgaaaaga gaatttcctc tgttaaattt 660 
ctttcagagt ctgtccttgc agtggctgga gatgaagaag ttataaggtt ttttgactgt 720 
gattcactag tgtgcctctg cgaatttaaa gctcatgaaa acagggtaaa ggacatgttc 780 
agttttgaaa ttccagagca tcatgttatt gtttcagcat cgagtgatgg tttcatcaaa 840 
atgtggaagc ttaagcagga taagaaagtt cccccatctt tactctgtga aataaacact 900 
aatgccaggc tgacgtgtct tggagtgtgg ctagacaaag tggcagacat gaaagaaagc 960 
cttcctccag ctgcagagcc ttctcctgta agtaaagaac agtccaaaat tggcaaaaag 1020 
gagcctggtg acacagtgca caaagaagaa aagcggtcaa aacctaacac aaagaaacgc 1080 
ggtttaacag gtgacagtaa gaaagcaaca aaagaaagtg gcctgatatc aaccaagaag 1140 
aggaaaatgg tagaaatgtt ggaaaagaag aggaaaaaga agaaaataaa aacaatgcag 1200 
tgaatcacag atgtctcctg aaagaactct tttagatgaa atcattctac tcaaatgtac 12 60 
cttaattttt tttttttccc tgagtaaaag caagaaattt cttcctttgg aaaaaatata 1320 
tatattaaaa aaccactttt agatggtttt ttttaaaaaa aaaaaaa 1367 

<210> 106 

<211> 1560 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4881515CB1 
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<400> 106 

aggcggactg gggaggcggc ggcctggctc 
gcggcggctc cagcaccatg tccctgcagt 
gttacggcgc ggccgattcg tttccaaagg 
aggaggcggc ggcggcgggc ggaggggttg 
ggggcgctga cagctccaag ccgaggattc 
cctccatcca gaaggtggtg tttcataaga 
gtaccaacaa gatttataag gatgacattt 
gggattttcc tgggcaaatg gacttttttg 
ggggaacagg agcattgata tacgtcattg 
caagacttca cattactgtt tctaaagcct 
tttttattca caaagttgat ggtctgtctg 
ttcatcaaag ggccaatgat gaccttgcag 
tttatctgac tagtatctat gaccattcaa 
aactcattcc acaactgccg accttggaaa 
gtattgaaaa agcttttctc tttgatgttg 
cccctgtgga tatgcaatct tatgaacttt 
tgtcttgtat atatgggtta aaggaagatg 
tggcaattat caagctgaat aatacaactg 
tggcactggt ctgcattcta agggaagaaa 
acttccactg tttccgaaaa gctattcatg 
ggagctgtgg tcaccagact agtgcctcca 
cacgaaacgc catctagtct gaatcccagc 
tccagggtcg gatgccacgt gctacaggac 
cctgttccac tagagacaag gggtagagtt 
tggtgtacaa ctgaagctac tatatctttt 
tctggagacc acagaactca agtgtgtgtt 



ggcctggcct ggcctgtcag ggcgcgggcg 60 
acggggcgga ggagacgccc ctcgccggca 120 
acttcggcta cggcgtggag gaggaggaag 180 
gggcaggggc aggcggtggc tgtggtccgg 240 
tgctcatggg actccggcgc agcggcaagt 300 
tgtcacccaa cgagaccctc tttttggaaa 3 60 
ccaatagctc ctttgtgaat ttccagatat 420 
acccaacctt tgactatgag atgatcttca 480 
acgcacagga tgactacatg gaggctttaa 540 
acaaagttaa cccagacatg aattttgagg 600 
atgatcacaa aatagaaaca cagagggaca 660 
atgctgggct agaaaaactc catcttagct 720 
tatttgaagc ctttagtaag gtggtgcaga 780 
acctattaaa tatctttata tcaaattcag 840 
tcagcaaaat ctacattgca acagacagtt 900 
gctgtgacat gatcgatgtt gtaattgatg 960 
gaagtggaag tgcttatgac aaagaatcta 1020 
tcctttattt aaaggaggtg actaaatttt 1080 
gctttgaaag aaaaggttta atagactaca 1140 
aggtttttga ggtgggtgtg acttctcaca 1200 
gtctgaaagc gctgacacac aatggcacgc 1260 
gtcggggctc tgtgccagct tactcttcac 1320 
atgggagctg ctgcttgtgg gaatctggtg 1380 
tctcatttgg atgaaaaccc cttcaactgg 1440 
ttgaaaatgg caaaaaaaaa aaaaaaaaat 1500 
tctcctcttt tgggtcccct ttaagtagtt 1560 



<210> 107 

<211> 1495 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5324681CB1 



<400> 107 

gaagaggctc tgggctggca catgtgtatg 
ctgtgggact ggcgttgtgc gggcagacgt 
ccacctccat agcaagcagt gatgatgaca 
aaaagaagtc acaagaaaat aaaggggagg 
ttctggcgtc caccttctcc aagtctggca 
gtctgattct tttccgtaca aaaccatggc 
ggtgtacagc cctgactttc atagcctcgg 
gagacgtcta ctccttttcg gtgctggagc 
acctgtctat gctgttagat gtggctgtga 
accgggacga gaagatccga gtcagctggg 
gcttggggca cacagagttt gtgagccgta 
ttctgtcctc ctctggggac ggcaccctga 
tgcactgctg tcacctggcc agtctgcagg 
ttgccgcgtc caggattgca ttctggtgcc 
gcactcctgt ggtctacatc ttccagctgg 
agcagctggc gttccagcac caagtgtggg 
gggtgctcca ggactgccag gaagcccccc 
ggcagtctgt tcctgagagc accgtgttaa 
gggccatgct ggaaggctct gccggcgcag 
cgttcgacaa cgtgacctcc tacctgaaga 
agaagaagca gcggcgccgg agtcccccgc 
gaccggggga ggcgacgcta agttgctgat 
cacttatgtg cgatccgtgg accacttgcg 
cggatgtagc tccatcgtaa gtcgaggagc 
tgtttttccg ctactgagtc tgtgtagtaa 



gcggtgaggc gggcgggtac atggcgggct 60 
tggtggtgcg gggcggcagc cgattcctgg 120 
gcctcttcat ctatgactgc agtgctgcag 180 
acgcgccctt ggaccagggg agcggtgcga 240 
gctattttgc tttaaccgat gacagtaagc 300 
aatgtctgag tgtcaggacc gtggcaagga 3 60 
aggagaaggt cttggtggcc gacaagtctg 420 
cacacgggtg tggccgtcta gagctggggc 480 
gtcctgatga ccgcttcatc ctcactgccg 540 
ccgcggcgcc ccatagcatc gagtccttct 600 
tctccgtggt gccaactcag cccgggctgc 660 
ggctctggga gtacaggagc ggccgccagc 720 
agctggtgga cccccaggcc ccccagaagt 780 
aggagaactg cgtggcgctc ctgtgcgacg 840 
acgcccgcag acagcagttg gtgtacaggc 900 
acgtggcttt cgaggagacc caggggctgt 960 
tggtgctcta caggcctgtg ggcgaccagt 1020 
agaaagtctc tggtgttctt cgtgggaact 1080 
acgccagctt cagcagtctc tacaaggcca 1140 
agaaagagga gagactgcag cagcagctag 12 00 
ctgggcccga cgggcatgcc aagaagatga 1260 
cgtggcggtc tgtttctgtc gactgtggac 1320 
tgcgatctgt cggccgacga tgagcttgtt 1380 
atctgtgatt tgtcctctgc ttatgggata 1440 
atttttgact aggaaaaaaa aaaaa 1495 
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<210> 108 
<211> 1919 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5387651CB1 



<400> 108 

cgccctgcat gcgagttggg ccgcgggcgg 
gacgccttag aaggagagag ctttgcgctg 
gatgctgtgg ttggatattt agaggacatt 
agaaatttca tggacaagta ctacctggag 
tacacaccta tttttaatga atacatttct 
ctgcagcgga ttcctgagtt caacatggca 
gatgaagtgg ctggtgacat attcgacatg 
aaagaaatgt ttttggacta cagagcagaa 
ggcttagtgg tgacttcatt gtgcaaatca 
cggcactagg tcctacctcc agccaatgaa 
aggctcagct catgatgaca gaacacatct 
catttatgtt aagtattaat aggtcaaaac 
atcctgaaac accttcttgt attcattaac 
cctctctcag gagcttctga gtcagacgcc 
cctggggggc ccttccccag catacctgct 
ccaccctctg ttccagcagg ctctgcatga 
tgggccacag tttcagtact tcagcctcag 
ctcaagccca gtttgagatt ttggagtctc 
atcaacagaa ggagacgtcc tctacataaa 
aacatttgca gtccttgtgt cactgtcttc 
agtctctttt aagcattatt tttgaaaaaa 
aacctagtgg atgtgatctt ggaacttcca 
atgctgtgtg ctttttaggt gtttgttagt 
acattcattc ctattttact gggcacctat 
aaacatattc ctatagcatg ttagtgtgtt 
aaccagtttg taaggttctc tgggttaggt 
ctctcctacc aagatggtgt tccactgtcc 
ctttactggc tgtttgtatg ctttggttta 
aacatatatg tcatctctag aacgaagaaa 
ctttgatttt ttttttttaa tagtaaaaat 
ttctggtttt aaaggacaag ctacaagctc 
aatacttttg taccatgagt aaaacttcag 

<210> 109 
<211> 2941 
<212> DNA 

<213> Homo sapiens 



ggttggagcc tactcggggc gactgcgatg 60 
tctttctcct ccgcctctga tgcagaattt 120 
atcatggatg acgagttcca gttattacag 180 
tttgaagaca cagaagagaa taaactcatc 240 
ttggtagaaa aatacattga agaacagctg 300 
gccttcacca- caacattaca gcaccataag 360 
ctgctcacct tcacagattt tctggctttt 420 
aaagaaggcc gaggactgga cttaagcagt 480 
tcttctctgc cagcttccca gaacaatctg 540 
tgggatcatt ctggatgtca ccagcccaat 600 
tggaaagact gactctgtta tgtaactctt 660 
caaaatgacc taaccctcct ggacctattt 720 
catagtactc ctccccacct caagtagaca 7 80 
tctggagcga gccctatgtc aggcactcca 840 
ggtgtgtaag tgtggactaa cccgccgcca 900 
atctttgtgc acttgcacct ctttttcaca 960 
tggggttcct gatgtttatc tagggtgtta 1020 
ctgtgatcac atcttgtctc ggctgtagga 1080 
agctccatgt gaaaagctac tcctagtctt 1140 
tggtcctgat gtagtcccac tgtttctaga 1200 
aaaatatttt tatagatgaa tactcaggct 1260 
tgattatcca cttaaagatc aaagtattat 1320 
actgtgaagg caaaaatgct ttctacattg 1380 
gaatgtatgc tgtgtgctag aaatagacta 1440 
tgcatgtttg ctgaaaatcc tttgtgtata 1500 
agggactctg cagtttcttc ctgtcaaaat 1560 
agcccagcat gagtagcagg tagagcacag 1620 
gtgcaatgtg tggtagatta cttatcagaa 1680 
aagcatagta gttcaattcc cagtgtgtcc 1740 
aagaatctgt actgactttt cacttggcca 1800 
tgtgtttctg tactgatgtg tcacttatta 1860 
gtgtttcgca agaaccacca ttctcaaaa 1919 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5595679CB1 



<400> 109 

attaggctaa taggacagca cttgaatggc 
atgcaagagt caggatgtcg tttagaacat 
atggaaggag actgggataa ggcagaaaat 
tctcctcatg ctattgtggt aagaggcgca 
attgtgagga tgaagttttt gctgctgcag 
aaggtcctgg aggcacttca agttctacgc 
gagcgcattc atgttcttag tgggtatctg 
aaagcagaat gggaaggcaa agggacagct 
acctatttac caccatcagt gatgcttccc 
gcggtggaac tacaaaggga tcggtgccta 



ttagggctca accagactgt tgatctcctc 60 
ccttctgcta ccaaattccg aaatcatgtc 120 
gacctgaatg aactaaagcc tttagtgcat 180 
cttgaaatct ctcaaacgtt gttgggaata 240 
cagaagtacc tagaatacct ggaggatggc 300 
tgtgaattga cgccgctgaa atacaataca 360 
atgtgtagcc atgcagaaga cctacgtgca 420 
tcccgatcta aactattgga taaacttcag 480 
ccacggcgtt tacagactct cctgcggcag 540 
tatcacaata ccaaacttga taataatcta 600 
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gattctgtgt ctctgcttat agaccatgtt tgtagtagga ggcagttccc atgttatacg 660 
cagcagatac ttacggagca ttgtaatgaa gtgtggttct gtaaattctc taatgatggc 72 0 
actaaactag caacaggatc aaaagataca acagttatca tatggcaagt tgatccggat 780 
acacacctgc taaaactgct taaaacatta gaaggacatg cttatggcgt ttcttatatt 840 
gcatggagtc cagatgacaa ctatcttgtt gcttgtggcc cagatgactg ctctgagctt 900 
tggctttgga atgtacaaac aggagaacta aggacaaaaa tgagccagtc tcatgaagac 960 
agtttgacaa gtgtggcttg gaatccagat gggaagcgct ttgtgactgg aggtcagcgt 1020 
gggcagttct atcagtgtga cttagatggt aatctccttg actcctggga aggggtaaga 1080 
gtgcaatgcc tttggtgctt gagtgatgga aagactgttc tggcatcaga tacacaccag 1140 
cgaattcggg gctataactt cgaggacctt acagatagga acatagtaca agaagatcat 1200 
cctattatgt cttttactat ttcaaaaaat ggccgattag ctttgttaaa tgtagcaact 12 60 
cagggagttc atttatggga cttgcaagac agagttttag taagaaagta tcaaggtgtt 1320 
acacaagggt tttatacaat tcattcatgt tttggaggcc ataatgaaga cttcatcgct 1380 
agtggcagtg aagatcacaa ggtttacatc tggcacaaac gtagtgaact gccaattgcg 1440 
gagctgacag ggcacacacg tacagtaaac tgtgtgagct ggaacccaca gattccatcc 1500 
atgatggcca gcgcctcaga tgatggcact gttagaatat ggggaccagc accttttata 1560 
gaccaccaga atattgaaga ggaatgcagt agcatggata gttgatggtg aatttggagc 1620 
agacgacttc tgtttaactt aaaattagtc gtattttaat ggcttgggat ttggtgcaaa 1680 
caaacatgat tgatagctgg acagacatgc tcgtcatgaa aaaagaacca tttctgaagc 1740 
ccgattgggg ccaaacattt acaccttgct tcatagtaac cagttgagat gaagcacgtc 1800 
gttagaacgt tgttggacac catgttgaat tattccccca tcggttgtga agaactgtgc 1860 
tacattcagg cttacccatt gaactcagta tatatatttt tttccttcct gtcttttgtc 1920 
tggcaggata ccattcttgt tgctcttctg tgtaatgaag tttaaatgct tgtttggaaa 1980 
actttattta acagtttaga aggcttgata gaaagagtgc attagtctga agagtataca 2040 
ttggatagga aagaatttcc ttcttttgtt tctccaaatc tttccgcctt atttagcttg 2100 
agatctttgc agcttggttc atggattcta gccttgcccg ttgcgcagta tatactgatc 2160 
cagatgataa accagtgaac tatgtcaaaa gcactctcaa tattacattt gacaaaaagt 222 0 
tttgtacttt tcacatagct tgttgccccg taaaagggtt aacagcacaa ttttttaaaa 2280 
ataaattaag aagtatttat aggattaaag tgacttcatt tgtatacatt tggaatctaa 2340 
accagcttaa aaacagtttc ctcaatgact tagatacaca gtataactga tgctcttctg 2400 
gaataccaca tgagacatgg tcagaaacag tgcttggaag gacattacac aagaaattca 2460 
gagtaatgct ttgaagattt cccccctttt gttttattcc tgaaggaaca tcagtacccg 2520 
atcttgaaga aattcaagat tcaaaaagaa ttttaaatac accaacatga gacatcagta 2580 
gtcagttggt tttcagtaaa gcttgttcca agttgttctc aacttaggaa gtaattttgg 2 640 
tgtgatctag caaaagagta ggaatcagcg atacaaccac tttggaagtt tatagtataa 2700 
ttgaaattat tagaagaatt cagcaggtta cagacatact taaactggga ttaaaacctc 2760 
atagtcattt ttcttaattg cccttaatat tttgacatat agggatacat aaatttaaag 2820 
aatatttttt ctcagttttt tcagatattg ccatactgaa cctcattcta aactggtgct 2880 
gtggatagtc tttccctctc ccctcctgtt ttagtttaag gaaaggtttc cttcatggaa 2940 
a 2941 

<210> 110 
<211> 710 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 5782457CB1 

<400> 110 

ctcgcggtcg cctggccgtt gtcattgtcc ctccgctgtc accttttcaa gccccaggct 
ggctgcttca gaagcccctg accccatgga ggagtgggac gtgccacaga tgaagaaaga 
ggtggagagc ctcaagtacc agctggcctt ccagcgggag atggcgtcca agaccatccc 
cgagctgctg aagtggatcg aggacgggat ccccaaggac cccttcctga accccgacct 
gatgaagaac aacccatggg tggaaaaggg caaatgcacc atcctgtgag ccccgcaccc 
ggcccctctc acaccatcct gtgagaccac gcccggcccc actcccacca tcttgtaaga 
ctgtgcccag ccccactcac tccatcctgt gagtcccact cccagcccca ctcccaccat 
cctgtgagcc catgcccggc cccactcaca ccaacctgtg agccccactc ccggccccac 
tcacaacatc ttgtaagact gtgcccggcc ccattcactc catcctgtga gaccacgccc 
ggccccactc actctatcct gtgagaccac gcctggcccc actcccacca tcctgtgagc 
cccactcctg gccccactca caccatccta tgagcccacg cccggcccca ctcccaccat 
cctgtgaacc ccactccact cgcacgtgat tacagtctgt aaaggtgtga 
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<210> 111 

<211> 1351 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 760677CB1 

<400> 111 

ccaggcctag tggatagaca gggtccaaaa tgtgaccctt ctaggctggt atcaccatgg 60 
gggcgtcatg ggctgaggat tctgcagata ggacatcacc acggcagaga tggacagcct 120 
gagacaggag agaggtgtca gtctaggatg gccaggctgg gtccccccac cccttactca 180 
agagtcactt gttctgtagg gcaagtctca catgaagcta ctcatcattt gcttcgtgtc 240 
ctccgagctc cgagagttgg caaagctgat gaaggagcag tagacagcga cccaagcaca 300 
ccacttcagc tagggaaaga ccgatttaag gctgcaagga aggagtcctg ggagcatggc 360 
tttccctgag ccaaagccgc ggcctccaga gctgccgcag aaacggttga agacgctgga 420 
ctgcgggcag ggggcagtgc gagccgtacg atttaatgtg gatggcaatt actgcctgac 480 
gtgcggcagt gacaagacgc tgaagctgtg gaacccgctt cgggggacgc tgctgcggac 540 
gtacagcggc cacggctacg aggtgctgga tgcggccggc tcctttgaca acagtagtct 600 
ctgctccggc ggcggggaca aggcggtggt tctgtggaat gtggcatcag ggcaggtcgt 660 
gcgcaaattc cggggccacg cagggaaggt gaacacggtg cagtttagtg aagaggccac 720 
agttatcctg tccggctcta ttgattccag tatccgctgt tgggattgcc gctcacggag 780 
gcctgagcca gtgcagacgc tggatgaggc cagagatggc gtgtccagtg tgaaggtgtc 840 
agaccacgag atcctggcag gctccgtgga tggccgcgtg agacgctatg acctaaggat 900 
ggggcagctc ttctcagact acgtgggcag ccccatcacc tgcacctgct tcagccggga 960 
tgggcagtgc accctggtgt ccagcctgga ctccacattg cggctcctgg acaaagacac 1020 
aggggagctg ctgggcgagt acaagggcca taagaaccag gaatacaagc tggactgctg 1080 
cctgagcgag cgtgacacac atgtggtcag ctgttctgag gacgggaagg tgttcttctg 1140 
ggacctggtg gagggtgcgc tggctctggc cctgcctgtg ggttccggtg tggtgcagtc 1200 
gctggactac cacccaacag agccctgcct gctgaccgcc atgggaggca gcgtccagtg 1260 
ctggcgagag gaggcctatg aggcagagga tggagcaggc tgaagccagg ggacccacca 1320 
acaggaccaa ggaccgagac acagacatgg c 1351 

<210> 112 

<211> 1783 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1348567CB1 

<400> 112 

cccacgcgtc cgcggacgcg tgggctgaag gctgtggcgc gcggctgtcc ccattcccac 60 

gtgaagcgct acgctagcat cgctcggctg gcggctccca gctcgccgcg gagcagtccc 120 

ggcagcagcg ggggaccgga agtggctcgc ggaggctcag aagctagtcc cggagcccgg 180 

cgtgtggcgc ctcggagcgc ggtgacggcg ccatgtccct aatctgctcc atctctaacg 240 

aagtgccgga gcacccatgt gtatcccctg tctctaatca tgtttatgag cggcggctca 300 

tcgagaagta cattgcggag aatggtaccg accccatcaa caaccagcct ctctccgagg 360 

agcagctcat cgacatcaaa gttgctcacc caatccggcc caagcctccc tcagccacca 420 

gcatcccggc cattctgaaa gctttgcagg atgagtggga tgcagtcatg ccgcacagct 480 

tcactctgcg ccagcagctg cagacaaccc gccaagagct gtcacacgct ctgtaccagc 540 

acgatgccgc ctgccgtgtc attgcccgtc tcaccaagga agtcactgct gcccgagaag 600 

ctctggctac cctgaaacca caggctggcc tcattgtgcc ccaggctgtg ccaagttccc 660 

aaccaagtgt tgtgggtgcg ggtgagccaa tggatttggg tgagctggtg ggaatgaccc 720 

cagagattat tcagaagctt caagacaaag ccactgtgct aaccacggag cgcaagaaga 780 

gagggaagac tgtgcctgag gagctggtga agccagaaga gctcagcaaa taccggcagg 840 

tggcatccca cgtggggttg cacagtgcca gcattcctgg gatcctggcc ctggacctct 900 

gcccgtccga caccaacaag atcctcactg gtggggcgga taaaaatgtc gttgtgtttg 960 

acaaaagttc tgaacaaatc ctggctaccc tcaaaggcca taccaagaag gtcaccagcg 1020 

tggtgtttca cccttcccag gacctggtgt tttctgcttc ccccgatgcc actatcagga 1080 

tttggtcggt ccccaatgcc tcttgtgtac aggtggttcg ggcccatgag agtgctgtga 1140 
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caggcctcag ccttcatgcc actggcgact 
gggctttctc tgacatccag acagggcgtg 
gctgctctct cacctgtgca cagttccacc 
tggactctca gatcaagatc tgggacttga 
gccactcggg ccccatcact agcatcgcct 
cggctgatga ctcctctgtc aagctctggg 
tgcagctgga taacaacttt gaggtaaagt 
tggctcttgg gggcacggat gtccagatct 
actttacaga gcatagcggc ctgaccacag 
tcgcttcaac aggcatggac agaagcctca 
tgatggaagc tgggcctcat ctcagtagag 



atctcctgag ctcctccgat gatcagtact 1200 
tgctcaccaa ggtgacagat gagacctccg 1260 
ctgacggact catctttgga acaggaacca 1320 
aggaacgtac taatgtggcc aacttccctg 1380 
tctctgagaa tggttactac ctggctacag 1440 
atctgcgcaa gcttaagaac tttaagactt 1500 
cactgatctt tgaccagagt ggtacctacc 1560 
acatctgcaa acaatggacg gagattcttc 162 0 
gggtggcctt cgggcatcac gccaagttca 1680 
agttctacag cctgtaggcc ctggcccttc 1740 
gggtagaatt agg 1783 



<210> 113 

<211> 3453 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1751354CB1 



<400> 113 

ggcttgcgca ctacgtcccc agccagaggc 
ggggtgtggt tccgggtcgt gtgcggctcg 
cggcggcgag cagcaggggc atgagggcta 
aggagcgccg gtccccgtgg atcccgagag 
tgggggagcc gggccctccc ctcaggaacg 
gaagcagctt caggtaggtg agctcgtgaa 
aggaaattgg cccacagaaa ggatggcctt 
tattcgacag tcacatgtga ccagtgatga 
tcatgatgtt gacctagaga agattcatcc 
aattcaggga agcaatggtg agactcaggg 
ctcaagttgg gactttggta ttagaagacg 
ccgaaaagag agacaaaacc agatcaaatg 
taagcaatca gcccaggagt taaagtcact 
tccaatttct gggaagcagt cgatattatc 
gaataaccct tttaacgagt attccaaatt 
aaccaagaag atcgatgtct acctccctct 
gaccgtggtg acaatggcca gcgccagggt 
gtatacaagc gaaggacggg agccgaagct 
tattgctgag gatgatgggg aggtggacac 
cattcataag tttggcttca gtactttggc 
gacatccaaa gagtcactct ttgttcgaat 
ggtggacaac acaaaggtta ccatgaagga 
aggatcccag aaagtttcag gccctcagta 
cgccgttgac ctggacagca ctttggagag 
cgagaacagt tcaagggcag acggggtttt 
agtacaggat atgcttagca gccaccatta 
actgcgattc acaaccgacg tacagctagg 
tgttacgaat cagaaagcca gcactaagtt 
ttccgacctg ctctgtgcct gtgaccttgc 
taaactcacg tatctaagca atcacgacta 
taccgtcaat gaaattgtgc tcaaggttaa 
ccgggctgac tactttgctc aaaaacaaag 
ccagaaggag aagaaatccg ggcagcagtg 
agctcagagc ctgcctgcca gggccaagtg 
tcggggggag gccagcccct ggctcactgg 
tcgggggccc cctaggaggg agcgctgggg 
agtggctttg ataagcgatg cttgggggtc 
ccagccacct tcaatgcctg ccaccctgcc 
tccttgcaac ttgatcaaat ttcttaaagc 
cttttaataa acaggtgtac tctttatcat 
gaggattgat tatgttactc tctttaaaat 



tcctacccgg tcggggactt ccggaacgcc 60 
gggtaatagg gctgctgctc ggccggccgg 120 
acccgggaag cggcagctga gcgggccggg 180 
tgcagagctc ggggcagggg ccgggaggcg 240 
tgtcccgggg ccgacccggc ccgtagtgtg 300 
acaatatgaa gaggagaaaa tagcctttta 3 60 
cttggacaat ccaactatca ttctagctca 420 
cacgggaatg tgtgagatgg ttctcattga 480 
tccttcaatg cctggagaca gtgggtcaga 540 
ctatgtatat gcccagtcag tcgatattac 600 
ctcaaacaca gctcaaagat tagaacgact 660 
caaaaatatt cagtggaaag aaagaaattc 720 
gtttgaaaaa aaatctctca aagagaagcc 780 
tgtacgccta gaacagtgcc ctctgcagct 840 
tgatggcaag ggtcatgtag gtacaacagc 900 
gcactcgagc caggacagac tgctgccaat 960 
gcaggacctg atcgggctca tctgctggca 1020 
caatgacaat gtcagtgcct actgcctgca 1080 
cgatttcccc ccgctggatt ccaatgagcc 1140 
cctggttgaa aagtactcat ctcctggtct 1200 
aaatgctgct catggattct cccttattca 1260 
aatcttactg aaggcagtga agcgaagaaa 1320 
ccgcctggag aagcagagcg agcccaatgt 1380 
ccagagcgca tgggagttct gcctggtccg 1440 
tgaggaggat tcgcaaattg acatagccac 1500 
caagtcattc aaagtcagca tgatccacag 1560 
tatctctgga gacaaagtag agatagaccc 1620 
ttggattaag cagaaaccca tctcaatcga 1680 
tgaagagaaa agccccagtc acgcaatatt 1740 
taaacacctc tactttgaat cggacgctgc 1800 
ctacatcctg gaatcgcgag ctagcactgc 1860 
aaaactgaac agacgtacga gcttcagctt 1920 
acactggcct ccagcctcaa tctgttccgt 1980 
ccctagagcc cacccggtgt cctgaagtcc 2040 
cacagggcag gtgggctctc ggggaaggtg 2100 
acattgccat gggacggaag tctgcttggc 2160 
agaccacccc ctagaggagc cacgtgccgc 2220 
cgaggatgta cagagccgtg cccacacatt 2280 
aaacaacaaa aatgtacatt tctgtttttc 2340 
ggttggtatg atggaccatt ctttggggcg 2400 
ctgttcccat attgaacagg cagattggaa 2460 
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aagctatggt tcgatttctc agaagaaatg tttaggtctt agtcaatagt tttaactatg 2520 
ccatttgttt aaatgagtgc atttgcttcg agggtagtgt cttactaaaa gttaggaaca 2580 
gagacctagt ggtgtgtcca aggccgtgtc actttcccct tcagcacacc ccagcttctg 2640 
acctcagagc ccaggagctg cgtggacagt gtggggtgcc aggaggaggg gcggtggctg 2700 
gtcctcaggc acgctgcact cccagccaga catggtcttt ccgtttctta agtagcaagt 2760 
gtaggtttca gctggcagtt ccacctgcat gttctctgct tcgctgcctt ggaaggggcc 2820 
acattcccca ttcctcttct ccttacagcg cctgcctcct ttttcaagca ggcggaaagc 2880 
tgctgtttct cacgtttcag ggagaggggt gagcggaggg agacctgtgt ccgtgccgtc 2940 
cggctccctg ggtgggaaca ggcaagggat cagatgcccc tgacaccacg cctctggcca 3000 
caccagatgc ctctgcagtc ctcgacagcc tcttcagtgt ccctcctgcg gtgatgtcct 3 060 
tactgtcccc agccagggcc ggggaccggt gtttcactga ggacctgcat tagaaacatt 3120 
ttttaaattg ttgtacagga agagatgtgt ctaaaacagc atcttaaagc tgagtgtatt 3180 
tctttgcaca aggggtcatg ctgatgaatt cttctttcat tctgatcttt gttcagccaa 3240 
caggagcgtc cttttctaat gtcttccatt cctacccccc acccaaaaac aaaagaaata 33 00 
tttgtagctt gctatctgta tttgaatttt tagcaatttt atatttagat actttgaaaa 3360 
atgtaaatga ctaatttggt cattaaatct tgtgacatat tcgatattaa aatgatatta 3420 
aaataaaagt catataaata cacaaaaaaa aaa 3453 

<210> 114 
<211> 2663 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1976780CB1 

<400> 114 

gaaaaaggtt cgaaagaact ggttgtcttc ttgggcggtg ttgcagggtt catctttact 60 
ttttaccaaa actcaaggaa gtagcacaag ttggtttggc agtaatcagt ccaaaccaga 120 
gttcacagtg gacctcaagg gggcaacaat tgagatggct tcaaaggata aatccagcaa 180 
aaagaatgta tttgagctga aaactcgtca aggaacagaa ctgctaattc agtctgacaa 240 
tgacactgtt attaatgatt ggtttaaagt tcttagtagt acaatcaata atcaggcagt 3 00 
agaaactgat gaaggaattg aagaggagat accggattca ccaggaatag aaaagcatga 360 
taaagaaaag gaacaaaagg atcccaaaaa gcttcgttcc tttaaagtat ctagcataga 420 
ttcttcagaa cagaaaaaaa ccaagaaaaa cttaaagaag tttcttacac gacgccccac 480 
tttgcaagct gttcgtgaaa aaggttatat taaagatcag gtatttggat ccaatctcgc 540 
taatctgtgt cagagagaga atggcacagt accaaagttt gtgaagttat gtattgaaca 600 
tgttgaagaa catggtttgg atattgatgg gatatacaga gtaagtggca acctcgcagt 660 
gatccagaaa ctaaggtttg cagtcaatca tgatgagaaa ttggacttga atgacagtaa 720 
atgggaagat attcatgtca ttactggagc cctcaaaatg ttttttcgag aattaccaga 7 80 
acctcttttt acatttaatc attttaatga ttttgttaat gcaattaagc aagaaccaag 840 
acagcgagtc gctgctgtta aggacctaat cagacagttg ccaaagccaa accaagacac 900 
aatgcagatt cttttccgac atctcagaag agttatagaa aatggagaga aaaatcgaat 960 
gacctatcag agtatagcaa ttgtttttgg tcccactcta ttaaaaccag aaaaagagac 1020 
tggtaatata gcagttcata ctgtgtacca gaatcagatt gtagaattaa ttcttctgga 1080 
actgagttcc atcttcggac gttgattctt actgaagaca acctgtggaa tagaagctgg 1140 
attccatcag atttcaaatg tttatacaca atgtatttta ttttttggac caagcagtga 1200 
ctctttgatt ttgcactttt tttttgaggg atcagaaggg aaggggagag tcgagatgtg 12 60 
tgttaggccc tcatatttgc tgctttgttg caagttgata taactgcgtg taattatgaa 1320 
ttcattttat cctgaatgtt tgcatttcat actctgaatt tcagtaaaaa tcaaaactta 1380 
aaattctaac cagtcatata cactggataa tttggtaaga aaactgtatt ttttttccct 1440 
gaaattggat aatgtacttt cttctcaaga ttcatgactt gatagaacaa tactttcagt 1500 
tatgttgcaa aggctcttgg gcattttaaa caaaatgaag tatatccatt ttgaaacctg 1560 
tgtatttctt tttcggggtt tctgcatgca gtggcagtct taagtgccaa aattcattat 1620 
aaccccaaaa taaccccttg atgaaggctt gctgtctttt actgtgttac acagcatcct 1680 
tactggatat cttagttgct tgtttgggca gcacactaat attacttaaa acactgtgat 1740 
atactggagt tttagttagc ggaagtcagt tcagggcatt ttagggctgt cttgctatac 1800 
tgaattgtag ctaacaatcc taattatatc tagtaccata ctgagttatt ggtatgaccc 1860 
tgtggaaaca cacattattt tatgtaaata taggctaaag acttaatgtc ctttagcttg 1920 
tgtatataat tgtgttgtat agtctcagag tacattctaa ccctacattt ctaatcattg 1980 
ttattggtaa tcttttctgt gaatattagg tttcctccag aaatggtccg ttatttggga 2040 
aagttaactg tgtgcacttt tagatattaa ctacatttac aggcaaatca ctgtaatgag 2100 
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aatggtactg gaaaaatact gaatagactt 
cttttgggtt atttaatatg tacagaaaac 
tccagtatga atagtggaaa cccatctgta 
ttgctaaatt tgagaatttc tttttaccta 
cagggtttgg aaggttttgt tacagggagc 
tttctagatt aacttctgta catgaaatgt 
ttccattgga aatgtgcatt ttggtttcta 
tgttatacct catttttaaa aatcaactga 
tcctcatttc ataccttttc tcctctcttc 
actgtttatt tctcatcaat att 



gctaaatggc acatgcacta caagaggaac 2160 
attagaaaaa atttattaca gaattctaat 2220 
aattagatgg atgttggatg gaaaatgaca 22 80 
ctaatgtaga ttgctttgta taataaaaca 2340 
atggtctgtt gaagattttt aaaatgtatt 2400 
ctaataaaac tataagaggt ttagagattt 2460 
atttttttgt tttttcattt actggcatac 2520 
atccaatatt tcctgtggca aataacactt 2580 
catgccaaca tttctccacc cacaacgtac 2640 

2663 



<210> 115 

<211> 1218 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2048234CB1 



<400> 115 

gcctgttgca gccatggtgc attgcagttg 
cgataagcta agactcttca ccaggggagg 
tggagaaggt ggaaaaggtg gtgatgtctg 
acaacttaaa gacaggtatc ctcggaaacg 
aattagtgca ctgaaaggct ccaaaggaaa 
ttcagtaact gatgaaaatg gtaaaattat 
tttggtagct caaggaggtc ttggtggtaa 
ccagaaacga ataattcacc ttgatctaaa 
cccaaatgct ggaaaatcct ctttgctaag 
agattacgca tttacaacat taaagctgaa 
acagatatca gtagctgatc ttccgggttt 
gggccacaaa ttcctcaagc atatagaaag 
ttctggattt cagctttctt ctcacactca 
gcttacaaaa gagttggaat tgtacaaaga 
agttaataaa atggacttgc cagatgccca 
ccagaatcct aaagattttc tgcatttatt 
agagttccaa catatcatcc ccatatctgc 
gaattgtata agaaagtcac tggatgaaca 
gaaacagttg cttaatttgt ggatttctga 
gcatgctgtt actacttcca aaatggatat 
gaacagtaaa aaaaaaaa 

<210> 116 

<2U> 1286 

<212> DNA 

<213> Homo sapiens 



cgtgttgttc agaaagtatg gaaatttcat 60 
atccggtgga atgggttatc ctcgtttagg 120 
ggttgtagcc cagaacagaa tgactttaaa 180 
gtttgtggct ggagtaggag caaacagcaa 240 
agactgggaa atccctgtgc ctgtgggtat 300 
aggagaactc agtaaagaaa atgacagaat 360 
attacttaca aatttcttac cattgaaagg 420 
acttatagct gatgtaggcc tagtaggatt 480 
ttgtgtttct catgcaaaac ctgcaattgc 540 
gctcggaaag ataatgtaca gtgatttcaa 600 
aatagaagga gcacatatga acaaaggaat 660 
aactagacaa ctactttttg ttgttgatat 720 
atacaggaca gcttttgaaa ccataatact 780 
ggaacttcag acaaaacctg cactcttggc 840 
agataagttc catgaattga tgagccagct 900 
tgaaaaaaac atgattccag agaggactgt 960 
agttactgga gaaggaatcg aagaattaaa 1020 
ggccaaccag gaaaatgatg cacttcataa 1080 
tacaatgtct tctactgagc caccatcaaa 1140 
aatttaaata tattaaaaat ggtattgatg 1200 

1218 



<220> 

<221> mis c_f eature 

<223> Incyte ID No: 2111754CB1 

<400> 116 

cccgccttga acttctggac tagcccctcg 
cgctctgcgt tgggccagcc cctcacagct 
ctatgcctgt tacccacact ccctgcgccc 
tataaaaccg cggaggagtg agctcttggg 
cgagcagcgc atttgtcttc taggctgctt 
ctgccagcta agtgtgggag aacttgtgca 
taacgccagc tttggctcca aggaacagaa 
cctgcttatc ctccgcgcct tcaagcccca 
agtcggcacc gctggtgtgg ggaaaagtac 
ccgtcatgag tacctgccga ccattgaaaa 



attgttgtag atgccaagcg gacctcgcgc 60 
ggtttcttac cacgtattgc gcaacggaat 120 
ccgcaccccg ctcctgtgcg caagtcggaa 180 
gtgtccagtt ggttgccgcg gcagtctctc 240 
ggttcgtgcc tccgagaaag gggtctcctg 300 
cgtatctccc ctccgaatcc caacgatggg 360 
gctgctgaag cggttgcggc ttctgcccgc 420 
caggaagatc agagattacc gcgtcgtggt 480 
gctgctgcac aagtgggcga gcggcaactt 540 
tacctactgc cagttgctgg gctgcagcca 600 
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cggtgtgctt tccctgcaca tcaccgacag 
gcgccacgtt atagcccggg gccacgcctt 
aaccctggaa gagctgaagg ccttctatga 
gcataagttc cccatcgtgc tggtgggcaa 
cctgaatgat ggtgccacct gtgcgatgga 
caagaccgat gtgaatgtgc aggagctgtt 
caccaccggc ctccaggagc ccgagaagaa 
gcttgacaag tgcataatca tgtgagccct 
tagcgtgtag aaaacgtgga ctcatttcac 
tgttgtttgg actgtaacat ccatgttgtc 
tctttttccc aggccagaga attcaaattg 
aaatgtagca tgatgtattt aaagta 

<210> 117 

<211> 3057 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2123286CB1 

<400> 117 

caaggctccg gcctgcgagg agtcacatta 
tctaaaagga acaggattaa agatgactga 
gcttagcaaa tcatatattc ttctgtggag 
tttggaactt tccaatccca gagaaaagtt 
atggaagaag aacttcctct tttctctgga 
tcatctttga agatgttaga tgtgggaaag 
gaactacagt taattcgtca ggcttgtgtc 
actacagtaa atgatgagat ttttgtgctt 
ggtgacgtcc agagcaccat tgaacctcgg 
gcctgcctca gctatgggag tggtccacat 
tttacctggg gtcataatgc ttatagccag 
gtgccctgtc atatctctac taatctgtca 
tcttaccatt ctttggtgct aacatctgat 
tctgggcagg taggatctgg atcaacagtt 
tgcctacaaa ataaagtagt tgtgaccata 
gtagacacgg gggaggtcta tgtctggggt 
aacagtggca accagccaac cccttgcaga 
agggtcgcct gtggctacgc acacacatta 
tggggcgcca attcttatgg gcagttgggc 
actcctgtca ctgtggaaaa ggacaggatt 
acgtctgcgg ccaagacgca gggtgggcac 
tccgtgatcc tcccgcacct cacccacttc 
gccacgcccg ccgtcacgtg gcgcctcctc 
gctgagtcac tgaagaggga atttgacaac 
gatggaaagt acatttatgc acataaagtc 
tcgtcattgg aagataacga ggatgatatt 
taccgggcct tcctggaata cctatacaca 
gtaggactgc tagacttggc tacattttat 
caaactatca agcaaggcat ctgcgaggag 
aagtatgatg cacaggattt agaagaattc 
gtagtaacac aaacatcagg ttttgcagaa 
agcaaagcaa gcagagttgg agcctttaaa 
gcctttccat ttcccctgca aaagccagag 
tatgatgagc tatgtttggc tgagtacttg 
gtctttgtct gcctaaaccc agagtttatg 
gactaaggtc aaggaaaaaa aattgaaata 
cttaaattgg acacctttgg taccctggtc 
cattattcag ctggctgata ccacatagat 
cacattttca gcactcagca cagtgccttg 
cttcagtatg tcctgagaac agctttagtc 



PCT/US00/19698 

caagagtggc gacggcaacc gcgctctgca 660 
cgtcctggtc tactcagtca ccaagaagga 720 
gctgatctgc aagatcaaag gtaacaacct 780 
taaaagtgat gacacccacc gggaggtggc 840 
gtggaattgc gccttcatgg agatttcagc 900 
ccacatgctg ctgaattaca agaaaaagcc 960 
atcccagatg cccaacacca ctgagaagct 1020 
gggccttaag agccagctct tcctatcctg 1080 
tatgttacat gtacatggtt gattttgtgc 1140 
aatacgtata ccttgtaagt ggataacttt 1200 
ttaaaacatt ggcatttgaa gaggagaaca 1260 

1286 



actttgctct agaagacaac tttacaagga 60 
atactgggtt ccagaaattt aaaacaatca 120 
ctgagaattg atgtccgctc ttccccgtga 180 
gacaaaggga ctgcccagga ctgagtccat 240 
gacagtggca agccagtaca ggctactctg 300 
tggccaattt tttccctttg ttctgaagaa 360 
tttggcagtg ctggcaatga agttttatac 420 
ggcacaaact gctgtggctg tttggggtta 480 
agactggatt ctttaaatgg caaaaaaata 540 
attgtccttg caacaacaga aggagaagtc 600 
ctgggcaatg ggacaactaa tcatggttta 660 
aacaaacaag tcattgaagt tgcctgtggg 720 
ggagaggtat ttgcctgggg ttataataac 780 
aatcagccaa tccctcgaag agtcactggc 840 
gcatgtgggc agatgtgctg catggcagta 900 
tacaacggaa acgggcagct tggactcggc 960 
gtggcagctt tgcaaggcat ccgtgtccag 1020 
gtattaacag atgaaggcca agtgtatgct 1080 
actggcaata aaagcaacca gtcctatcct 1140 
atcgagattg cagcctgtca ctccacacac 1200 
gtgtacatgt ggggccagtg ccggggtcag 1260 
tcctgcactg acgacgtgtt tgcctgcttt 1320 
tccgtggaac ctgatgacca cctcacagtg 1380 
ccggacactg cagacctgaa gtttctagtt 1440 
cttctcaaga ttcgatgtga gcattttcgt 1500 
gtagaaatga gtgaattttc atatcctgtt 1560 
gacagcatca gcctttctcc tgaggaggca 1620 
agagaaaatc gtttgaaaaa gctctgccaa 1680 
aatgccatcg ctctgctctc ggctgcggtg 1740 
tgcttcaggt tttgcataaa ccatctgact 1800 
atggaccatg atctcctgaa gaactttatc 1860 
aattgatccc atctgcagga aagtttttga 1920 
atgaatcact tctctttaat taatagtatg 1980 
taactgtcag aagaaggatg gtggtgagtg 2040 
tagaaagcat tgaatgttct gatcagatgt 2100 
tcttatttac catttcctct ttttgagtca 2160 
tcagtatatg ctattctggc ccaaatgttc 2220 
agcttgacaa ggagtgctgt ctgtccttac 2280 
tgtataatag gcactcaatt tattataaat 2340 
atggaatact gggagaagga ataactttca 2400 
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caaaataaac ttaaaacagc ctgtaattat tgaggttcat attcttctgg tatatcattc 2460 
tgagaaattg tggctaattt agaacattgt ttagaattga caaaaggccc tggcaattaa 2520 
attgtcaagg cccaagggct aattttaatt ttctttttac ttggagtcat tcattaattt 2580 
ctcacatggg attatggagt atgaagtatt atctttgaat gaaattcctg ggctgatctg 2640 
ccttacataa tcacataagg tcctttgctt ttctttgtgt taagagggac ttgcctctgt 2700 
aaatgaaaat gacaatgtgc ttttcttgta gttgactttc atgtcactca ctataaaata 2760 
ggtctcttaa cctggcacca gtataactat aaagcactag ctgagaagga actgatactt 2820 
acatttcatg gacagcatta acaagaatga gataaatttg tactttttag atcaaaacaa 2880 
attacctaat tgcaaaagag aaactgaaat ggaacatagt ctcagattct tctaatgtgt 2940 
atctcacaat gtcatgtaat gtaaaggaaa cccttttgga attagaattc ttgttctgat 3000 
gctgaactat ttggtaataa agtgcttatt tgcagataac agaaaaaaaa aaaaaaa 3057 

<210> 118 

<211> 1803 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2477507CB1 

<400> 118 

ggtcgcggcc ttcacgggtt cctcggccgt cctgagtccc aaatcccaag ctggagccgt 60 
tcagctcccc tccaccgctt agagatttgg ggggctctgg ccccgtcctt gcggaccatt 120 
ccgaggggag tccagaggtg aggccgagga acctccctga ctttgcgggg cgcgccgctc 180 
ctgcgtctcc tgccagtctc ccttccttct tttcggtcaa caattgaaaa caaaacgagg 240 
aacagcagag gagctactgt ataccgagcc ctcagcattg ttcgtaatct ccgcctgcta 300 
acagccttgt gaagaaggtg ctattcttct caacacttta cagatgagga cacttgaggt 360 
tcggagacgt ggagcctctt gcacagctgc ttaagtggtg gtagagccga gatttgaacc 420 
ctcctaacca ttcctttctg ccgcctactg cagctcccag cagagatgat tgaactgttg 480 
ctcggggtag ggcccccagg tgtcagtaat taacactgtg gatacctccc atgaggacat 540 
gattcacgac gcccagatgg actactatgg cacccgcctg gcaacctgct catcagacag 600 
gtccgtcaaa atctttgatg tgcgcaatgg agggcagatc cttatcgccg acctcagggg 660 
tcatgagggt cctgtgtggc aagtggcctg ggctcacccc atgtacggca acatcctggc 720 
atcgtgctcc tatgaccgga aagtcattat ctggagagag gaaaacggca cctgggagaa 780 
gagccacgag catgcgggac acgactcctc agtgaactcg gtgtgctggg ccccccatga 840 
ctacggcctg atcctggcct gtgggagctc ggatggggcc atctccctgc tgacttacac 900 
cggggaaggc caatgggaag taaagaagat caacaacgct cacaccattg gctgcaatgc 960 
cgtcagctgg gcccctgctg ttgtacctgg aagcctcata gaccacccat cggggcagaa 1020 
acccaattac atcaagaggt ttgcatcagg tggctgtgac aacctcatca agctgtggaa 1080 
ggaggaggag gacggccagt ggaaggagga gcagaagcta gaagcgcaca gtgactgggt 1140 
tcgagatgtg gcctgggccc cctccatcgg cctgcccacc agcaccatcg ccagctgctc 1200 
ccaggatggt cgtgtgttca tttggacctg tgatgatgcc tcaagcaata cgtggtcccc 1260 
taaattgttg cacaagttca acgatgtggt gtggcatgtg agctggtcca tcacagccaa 1320 
catcctggct gtctctggtg gagacaataa ggtgaccctg tggaaggagt cagttgatgg 1380 
gcagtgggtg tgcatcagtg atgtcaacaa gggccagggc tccgtatcag catcagtgac 1440 
agagggccag cagaacgagc agtgacaaga caggtggggc ctggctcccc acccgccagc 1500 
tccaggactg ccccttcctg ggccaactaa ccagacaact gggaagagcc cccaactcca 1560 
acaggattat tttcccagga ggagttacag atgcagccac- agattgatca tctgccttaa 1620 
cgtgatcgga gatgctttgt aatctactgt ccagctgaaa gcactcatgt tacga'ggaag 1680 
aaactacaag tgatgttcaa atctattttg ggtcattttt atgtaccttt gggttcaggc 1740 
attatttggg gggttttgtt tccaaaggaa ctaaataaag tcatattgct tataaaaaaa 1800 
aaa 1803 

<210> 119 

<211> 4407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2759119CB1 
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<220> 

<221> unsure 

<222> 4373, 4379 

<223> a, t, c, g, or other 

<400> 119 

ggtcgtcatt ggacaaccgc cgcggggccc 
gttgatgggc agcgcggcgc gctgggaagg 
atacccgtgg ctgcctcagc acctcgagga 
ggccgcgggc aacctcggag cttatactcc 
accggctgtt ggcgggtgag ggtcccgatg 
atctgcggat gataaagcga gtgcagaacc 
gggtacggcc agagcctaat ggagaccttg 
gcaagggact ccgagttgtg aaaattagct 
gctctggcct gtggaacatg tctgactgga 
tagccttggc cctgggccac aactcagtgg 
tgcaagaggt gccctgcaca gacaggtgca 
cctggaagga gctgaccata gtggcaggtg 
cagcaactgc cttagcagac aacaaacctg 
tgggcatcat cttcagcatg tcatacctgg 
aagaccgaag cgttcgtatc tggaaggtgg 
agaatattgg gcactgcttt gggcacagcg 
attaccttat cagtgcagga gaggattgtg 
tcctccaggc ctttcgggga caccagggac 
ggcaggcctg ggtgatcact gggggtgatg 
ggcgtgggta ccggggattg ggggtctcgg 
gtacactcaa ggctgtgact ctggctggct 
gggccctgta tctctatgac gtcgaggtca 
atttccagtc ctactgcctg ctggaggcag 
ctatggccaa tggggaaggt cgtgtcaagg 
tggaccagac cctgtttcct gggaaggtgc 
aggagctcct gttgctggca tcgggccctg 
ccgcaccctc tggcaaggcc atctttgtca 
gcaagcagag atggcacaca tgcagtgcct 
gtgaccgccg gggctctgtg ctgctattcc 
gggtgggagg caaggctcgg gctggtgctg 
ggggtgggaa tgctttcact gggttgggcc 
agcagggtgt gacctcagtc acatgccatg 
gagcctacta ccagctgttt gtacgagacg 
cctgtcgagg catgaactgg ctagctgggc 
tcctgggttt ccatgccaat gagtttgtgg 
acatcgtcaa ctgtggtgga gggcaccgtt 
tggcctttgc ttacctcaag gatggggatg 
cccggccaca cgtgattctc cgggagggtc 
gtgtgggcac cattaccctg gggcctgaat 
acctggagcc tggcagtgag gggcccgact 
acactac'tgt ctgtgtccta gcactcccta 
ctgtttgtaa ccatatctcc tcggtacgtg 
gtggccctca ggatcctcag ccaggcctga 
ctgagatgca ctgcttcagc atcatggtta 
cctgccatgt catgcacctt tcgtcccacc 
atcggcatcg gatggttaag gtagacccag 
aacttgacca gcccggcctt ggcccccttg 
gctctttctt ttgcaggatt ctgggcggat 
taagcgatgt gtcctcaagg tccactcctt 
gctcctcctg tgcagcgcag ctactgatgg 
gctagaccat gactccactg tcctggagcc 
tggcaccccc tccctgactc tccaggccca 
gcccacccgt gagggccacc atctcgtggc 
cttcgtgctt gctgtggaga tgctacagct 
accccagctg cgtgtgctag aggaatactc 
aggcctcaag atcctaagcc caagcatcat 
cttctggcgt ctggggcatg gtgaacccac 



tggtctctgc tacctgtagc tgagggtgct 60 
ctcgttctcg cgagagttca gctcccttct 120 
tcgacatgga cgctctcgag gactacgttt 180 
tcccagtgac gggtctggag tgcgtggggg 240 
tcctggtgta cagcttggac tttggtgggc 300 
tgcttggcca ctatcttatc cat'ggcttcc 360 
acttggaggc catggtggct gtgtttggaa 42 0 
ggggacaggg ccacttctgg gagctttggc 480 
tttgggatgc acgctggctt gagggaaata 540 
tgctatatga ccctgtagta gggtgcatcc 600 
ccctctcttc agcctgcctg attggagacg 660 
ctgtttccaa ccagctcttg gtctggtacc 720 
tagcacctga ccgacgaatc agtgggcatg 780 
aaagcaaggg attgctggct acagcttcag 840 
gcgacctgcg agtgcctggg ggtcgggtgc 900 
cccgtgtgtg gcaggtcaag cttctagaga 960 
tctgcttggt gtggagccat gaaggtgaga 1020 
gtgggatccg ggccatagct gcccatgaga 1080 
actcaggcat tcggctgtgg cacttggtag 1140 
ctctctgctt caagtcccgt agtaggccag 1200 
cttggcgact gctggcagtg actgatacag 1260 
agtgctggga gcagctgcta gaggataaac 1320 
ctcctggtcc cgagggcttc ggattgtgtg 1380 
ttgtccccat caacactcca actgctgctg 1440 
acagcttgag ctgggccctg cgtggttatg 1500 
gcggggtagt agcttgccta gagatctcag 15 60 
aggaacgttg tcggtacctg ctgcccccaa 1620 
tcctaccccc aggtgacttc ctggtgtgtg 1680 
cctccagacc aggtctgctc aaggaccctg 1740 
gggcacctgt agtgggtagt ggtagtagtg 1800 
cagtgtctac cctgccctct ctgcacggga 1860 
gtggctatgt gtataccata gggcgtgatg 1920 
gccagctcca gccagtccta aggcagaagt 1980 
tccgtatagt gcccgatggg agcatggtta 2040 
tgtggaaccc tcggtcacac gagaagctgc 2100 
cgtgggcatt ctctgatact gaggcggcca 2160 
tcatgctgta cagggctctg ggtggctgca 2220 
tgcatggccg tgagatcact tgtgtaaagc 2280 
atggagtgcc cagcttcatg cagcctgatg 2340 
tgactgacat tgtgatcaca tgtagtgagg 2400 
caaccacagg ctcagcccac gcactcacag 2460 
ctgtggctgt gtggggcatt ggcaccccag 2520 
ctgcccatgt ggtgtctgcg ggggggcggg 2580 
ctccggaccc cagcacccca agccgcctcg 2640 
ggctagatga gtattgggac cggcaacgca 2700 
agaccaggta catgtccctt gctgtgtgtg 2760 
tggctgcagc ctgtagtgat ggggccgtaa 2820 
tctgcagctc cttgctgaaa ccttccacca 2880 
tacacacgag gcacccaacc agaggcggag 2940 
cagcctggct ttctgggatc tcaccaccat 3 000 
tccagtggat cctgggcttc cctaccggct 3 060 
cagctgtggt atcaacagcc tgcacacctt 312 0 
cagtggcagt gaagatggat ccctccatgt 3180 
agaagaggct gtgggagagg ctgggctggt 3240 
tgtcccctgt gcacatgctg cccatgtgac 3300 
ggtctcagcc tccattgatc aacggctgac 3360 
cttcatgaat agcactgtgt tccatgtgcc 3420 
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tgatgtggct gacatggact gctggcctgt 
tgggggtcag gggcttgagg tttacaactg 
gtgctgggca tggggcctgc tcacagacag 
tgctcagcat gccttgaggg gaggaggtgg 
gctgaaggtg agtggagtgc tgccaagaat 
ttgtaacaaa cagtaccaat ttattttggc 
actttgtgaa cattcccagg tattggagcc 
gagacccagc atagccaggc cagtatggag 
gcggacgaac atctgaccaa agaggtgtgg 
ggtctcatpc tctgcttcct ttgcctttac 
ttgctgtgtg ctcaccccca ggatgtgtac 
gagctgtacc aggaacttgc atatctagag 
gctcttgtgc cccaggccag aataaagaat 
tcaccatctc tgtgcgtaca gcaatgtgga 
ccggctcagc agatccggaa agggaggata 
ttggggagaa aagtgcttgg aaggggaatc 
aatcactcat catgtagcag tgagaag 



gagccctgag tttggccacc gttgtgccct 3480 
gtatgactga ggtatcctgc ggtggctggc 3540 
catggagcag ggaagggctg tctgtgccca 3600 
tggccgtggg ttcctgatgt cggtgcagga 3660 
atgcccgact ccccatgaca agacagaact 3720 
cgtgggtttt tgcttttttt ccagttgatg 3780 
tctgtggcct taaatgtggc tcagtggagg 3 840 
cacctcacgc acagctctca gaagctgcag 3900 
tcgaggctcc tgaaagagaa agggcctgct 3960 
cctatacctc tctgcacgtc ccaccccgtt 4020 
ccggttgtag taggagctga aatccatgct 40 80 
acagagactg agtcactggc ccatctcttt 4140 
agagtgtaga gtgtcctggt tgtctatgcc 4200 
ccccggggct gtgcagtcca gcactgctgt 4260 
ctgttgaaga gcaacaacca ctcaccctgt 4320 
caggctcctt gtgccagtaa cangagggnc 43 80 

4407 



<210> 120 
<211> 959 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2823818CB1 



<400> 120 

cccacgcgtc cgcccacgcg tccgggagcg 
gaagcccaga ccagtgcggt cctagccaga 
gaagcgccca tgcagatggc ctctgcccaa 
gatcagaact ttgactacat gttcaaatta 
acatcttttc tattccgtta tgcagatgac 
gggatcgatt tcaaagtaaa aactgtattc 
tgggacacag caggccagga aagatacagg 
atgggcttta ttttaatgta tgacattaca 
tggtcaactc aaatcaaaac atactcttgg 
aagtgtgaca tggaagacga gcgggtcatc 
cagcttgggt ttgagttttt tgaaacaagt 
tttgagcgcc ttgtggatat catctgcgac 
gccatcactg ctgcaaagca gaacacgaga 
aactgtgcct gctagtgtcc ccgtgcacac 
aaacagcatt tgtaaatggt ctattagcct 
gaataaattg atgtcaatgg ctcgtaaaaa 

<210> 121 

<211> 1809 

<212> DNA 

<213> Homo sapiens 



tggagcgccg ggactgtgca cgcttgaccg 60 
gagaaaggac atttgccaac aatgagacac 120 
gatgccaggt acggccagaa agactcctct 180 
ctcatcatcg gcaatagcag tgtggggaaa 240 
tcctttacat ctgcattcgt cagcacagtt 300 
aaaaatgtaa agagaatcaa gcttcagatt 360 
actatcacca cagcctatta tcgtggagcc 420 
aatgaagaat ccttcaatgc agtacaagat 480 
gacaatgccc aagttattct ggttgggaac 540 
tcaactgagc gaggtcaaca tttaggagaa 600 
gccaaggaca acattaatgt caagcagaca 660 
aaaatgtcag agagtttgga gactgatcct 72 0 
ctcaaggaaa ctcctcctcc accgcagccc 780 
aggcagctcc agggggctct ggttgccaac 840 
tcatttatac tgcctaacaa ttatttgaag 900 
aaaaaaaaaa aagtaaaaaa aaaaaaaaa 959 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2859730CB1 



<400> 121 

ggcagcggtt ggaggcttcg cccggctttg 
cacctcggcc cggacacgat gaggcgagtg 
gggcaagcgg tgaaaaatga ccagtgctat 
gatagttcct tttgtgctgt caatcccaga 
ggaggagcgt tccttgtcct ccctctgcgc 
acagtatgtg gccacacagg accagtgctg 
gtcattgcca gcggttcaga ggactgcacg 
ctcacccttt ccctgactga acctgtggtg 



cagcggggac ttcggcggcg gcgcctcagg 60 
gtacgacaga gcaagtttcg gcatgtattt 120 
gatgacatcc gggtttctcg tgtgacctgg 180 
tttgttgcca taatcataga ggcaagtggg 240 
aagactggtc gaattgacaa atcttaccct 300 
gacatagact ggtgcccaca taacgatcag 360 
gtcatggtat ggcagatccc agaaaatgga 420 
attttggaag gccactcaaa gagagtcggc 480 
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atcgtggctt ggcatccaac ggcccgcaat. 
attatcatct ggaatgtggg aacaggggaa 
gacatgattt acaatgtgag ctggaaccgg 
gacaagaaag tgagagtcat tgatcccagg 
gcacatgaag gagcaagacc catgagagcc 
actgggttca gccgcatgag cgagcggcag 
gaaccaattg ctcttcatga gatggacact 
cctgacacca gcatcattta cttatgtgga 
atcacggatg aatccccgta cgtccactac 
agagggatgg gttacatgcc caagagggga 
ttcttcaaac ttcatgagag aaagtgtgaa 
gaccttttcc aagatgacct gtatcctgac 
gaagagtggt tcgaaggcaa gaatgcagac 
attccaggca aaaacaggga tctcaaggtg 
actgcaaaca agaagtgcga cctgatcagc 
gtgcaaaatg aagccaagtt ggatgagatt 
atctgcaatc aagatgagcg tatttccaag 
tgaaggtccc acccccaccc ctacagaaaa 
ggttattggt gtggtcctag ggagggcgga 
atttcagatt tgtcaaccag cgataggcca 
aatttgcaga aacaaaacgt gatttaaaag 
gttttaagtg ttatgtgact tgttgaactt 
tactctgaa 



gtgcttctta gtgcaggctg tgataatgcc 540 
gcccttataa acttggacga tatgcattca 6 00 
aatggcagtc tgatctgcac agcttccaaa 660 
aaacaagaga ttgttgctga gaaggagaaa 720 
atcttcctgg ccgatggcaa tgtcttcacc 780 
ctggctctct ggaatccgaa aaatatgcag 840 
agcaatgggg tgttgctgcc tttctatgac 900 
aagggtgaca gcagtattcg ctattttgag 960 
ctcaacacat tcagcagcaa ggagcctcag 1020 
cttgatgtta acaaatgtga gattgccaga 1080 
cctattatta tgactgttcc caggaagtct 1140 
acagcggggc cagaggccgc gctggaggca 1200 
ccaatcctca tctccttgaa gcacgggtac 1260 
gtcaagaaga acattctgga tagcaagccc 1320 
atccccaaga aaaccacaga cacggccagt 13 80 
ttaaaagaga tcaaatctat aaaagacaca 1440 
ttagaacagc agatggcaaa gatagcagcc 1500 
aatgggagca agaacttgtg cttgggagct 1560 
aagggaggca ctgccatttg gagacattcc 1620 
cattccagta agaactcaat ttgtctccca 1680 
ctgagctttt tatcagaaag cttttttgat 1740 
tttaaaaagt gctactttta aaatcccaga 1800 

1809 



<210> 122 
<211> 2028 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2861155CB1 

<220> 

<221> unsure 

<222> 1943, 2003 

<223> a, t, c, g, or other 



<400> 122 

tggcgggttc 

gtggaccgaa 

gtgctttcaa 

tcccagaaga 

cttcgaagcc 

ttggagactc 

catccttctc 

cctcccgaaa 

tcccaggagc 

ggttccccta 

cttcctgcag 

gggtctggaa 

accgcttgga 

ttcagcaccc 

atcttgacca 

catgaggggg 

gataccatct 

tacgtggcca 

ctccgggctt 

tgcttctccc 

tccacggccc 

cagagcggca 

tcctacggcc 

ggagggcacc 



cgtgggtcgc 
atcccgcgac 
aagaattggc 
ggagggaagc 
atcctggctg 
aaccgttagc 
cccacgcttc 

ggggggatcc 

tacgggaggg 
gtgagttgag 
aagaagcaaa 
aagccatgga 
actacagctt 
aacctgagaa 
atagtgctga 
agcaggtgga 
atgattactg 
gcagcagccg 
cctttcgcgc 
cggatggctc 
ggcctggccg 
tcatctcctg 
gctccctggg 
aagggggcat 



ccgcgaaatc 
agcaagaggc 
gtccgctgtt 
acaggtgggt 
aggctaatct 
tccggactgc 
cccgatgaat 
gccccggttg 
ggacccagtt 
tcctcgaatc 
cgggagcctt 
agatacctct 
ctcccagctg 
cttcttgaaa 
taacatcttg 
atatgcagaa 
ctggtattct 
ggagaacccg 
ctacaaccac 
ccagctcttc 
agactgcgag 
catagccttc 
tctgtatgcc 
cacccacctc 



tgatccggga 
ccgtagcgac 
cgcctctcct 
ttctttagct 
ccgctgtgct 
tgtccttcag 
aaaaatgcgg 
tccccagatc 
tctctctcca 
gaggagcaag 
tctgaagaag 
ggggaacccg 
cctcgatttc 
ggctgtaagt 
cgaatttata 
atggtccctg 
ctgatgtcct 
attcatatct 
ctggatgagc 
tgtggcttca 
gtccgagcca 
agcccagccc 
tgggatgatg 
tgctttcatc 



tgcggcggcc 
ccgcggtgct 
cccgggagtc 
ctgcgtcgga 
tcctctgcag 
accaggaccc 
actctgaact 
ctgtggctgg 
ctcccctgga 
aactttctga 
aagcgaacgg 
ctgcagagga 
tcagtggttc 
gggctcctga 
acctgccccc 
tccttcgaat 
cagcccagcc 
gggacgcatt 
tgacggcagc 
accggactgt 
catttgcaaa 
agcccctcta 
gctcccctct 
ccgatggcaa 



caatcggaag 
aaggaacaca 
ttctgcctac 
tccctgagaa 
tatgaagact 
agctccagcc 
gatgccaccg 
ctcagctgtg 
aacagagttt 
aaatacaagc 
gccagagttg 
cgagggagac 
ctggtcagag 
cggttcctgc 
agagctgtac 
ggtggaaggt 
agacacctcc 
cactggagag 
ccattcgctc 
gcgtgttttt 
aaagcagggc 
tgcctgtggc 
cgccttgctg 
ccgcttcttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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tcaggagccc gcaaggatgc tgagctcctg 
ctgtggtccc tgggtcgaga ggtgaccacc 
accgggcagt tcctagtgag tggcagcacg 
gggcctggca atgatgggaa gccggagccc 
accaatggcg tgagcctgca ccctagcctg 
gtgtttcctg agcccacaga gagtggggac 
tccacgcgcc acgtccacct tgaatgtcgg 
gactccagca tccctgatga tcaccagggc 
tcgtgggggg cgtgatataa aanggtgttt 
cctgataagc ctaggaatgc canagcccag 

<210> 123 

<211> 2223 

<212> DNA 

<213> Homo sapiens 



tgctgggatc tccggcagtc tggttaccca 1500 
aatcagcgca tctacttcga tctggacccg 1560 
agcggggctg tctctgtgtg ggacacggac 1620 
gtgttgagtt ttctgcccca gaaggactgc 1680 
cctctcctgg ccactgcctc cggtcagcgt 1740 
gaaggagagg agctgggcct tcccttgctc 1800 
cttcagctct ggtggtgtgg gggggggcca 1860 
gagaaagggc agggaggaac aggagggagg 1920 
gagtggctgt gactccttcc tacacagggc 1980 
ctgtagggtc ccagtccc 2028 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3002667CB1 



<400> 123 

gcgcgcacgt ggggccgggg cggagagagg 
tggaatgggt gaatttaagg tccatcgagt 
ccgctgtgtg gcttacaata accagtcaaa 
tgtggaaatt tataacttgt cagcaaacta 
gtctcgggct acagaagctt tgtgctgggc 
caatggcgag attatggagt atgatttaca 
ctttggagga cctatttgga gcatggctgc 
ttgtgaagat ggatctgtga aactatttca 
aaattttgat cggcagaaaa gtcgcatcct 
cattgcagct ggttccatag actacattag 
tcataagatg attgtggaca ggcagtatat 
gggtgtcgcc ttcttgtccg atggcactat 
gttctgggac tcagccactg ggacgcttgt 
gcagtccatt gctgtagctg accaagaaga 
agtcttccat tttcagctgg tccctgtgac 
gacaaaaccg ttccagcatc acactcatga 
gctgatatct ggaggcactg acacccactt 
agtaaagaat tacgatgccg ctctccgaaa 
ctgttctaaa aagaggcagc ttctcctctt 
actgggatcc acagttgcaa caggcaagaa 
agatcattta ctgcacctaa agacaaaggg 
cccatgtgga agttggatag cctattctac 
ttatgaacat gacaacataa gcctcaaaag 
tgcccttcag attttgtttt ctgaagattc 
agctctgcat attgttcagc tgtcaggagg 
tcagtcagga acagtggagg ccatgtgtct 
agctgcatca ggtaccagtg ctggagtcca 
ctgcacggtg cctgcttaca atttcccagt 
caaccttgtc atcgctcatt cggaccagca 
gtatacagat tggagccgga ctgtccagaa 
ggatactcct atcacacaca tcagttttca 
tgatgcctac atgttctgca tcattgacaa 
actctacaat ccatttcctc ccacgaatga 
tgcttttaaa atttctaaga tatataagcc 
aacactcgtg gcagtagaac ggcctctgga 
taaaaagaag aaatttggaa cctaaaacag 
taccctgttg cttttcacaa atcatggtaa 
aaa 



cgagcaccgg gaaggggagc gtggggccgc 60 
acgtttcttt aattatgttc catcaggaat 120 
cagattggct gtttcacgaa cagatggcac 180 
ctttcaggag aaatttttcc caggtcatga 240 
agaaggacag cgactcttta gtgctgggct 300 
ggcgttaaac atcaagtatg ctatggatgc 360 
cagccccagt ggctctcaac ttttggttgg 420 
aattacccca gacaaaatcc agtttgaaag 480 
gagtctcagc tggcatccct ctggtaccca 540 
tgtgtttgat gtcaaatcag gcagcgctgt 600 
gggcgtgtct aagcggaagt gcatcgtgtg 660 
cataagtgtg gactctgctg ggaaggtgca 720 
gaagagccat ctcatcgcta atgctgacgt 780 
cagtttcgtg gtgggcacag ccgagggaac 840 
atctaacagc agtgagaagc agtgggtgcg 900 
cgtgcgcact gtggcccaca gcccaacagc 960 
agtctttcgt cctctcatgg agaaggtgga 1020 
aatcaccttt ccccaccgat gtctcatctc 1080 
ccagtttgct catcacttag aactttggcg 1140 
tggggatact cttccactct ctaaaaatgc 1200 
tcctgagaac attatctgta gctgtatctc 1260 
agtttctcgg ttttttctct atcggctgaa 1320 
ggtttccaaa atgccagcat tccttcgctc 1380 
aacaaagctc tttgtagcat caaatcaagg 1440 
aagcttcaag cacctgcatg ctttccagcc 1500 
tttggcagtc agtccagatg ggaattggct 1560 
tgtctacaac gtaaaacagc taaagcttca 1620 
gactgctatg gctattgccc ccaataccaa 1680 
ggtatttgag tacagcatcc cagacaaaca 1740 
gcagggcttt caccaccttt ggctccaaag 1800 
tcccaagaga ccgatgcaca tccttctcca 1860 
gtcattgccc cttccaaatg acaaaacctt 1920 
atcagatgtc atccggaggc gcacagctca 1980 
tctactcttc atggatcttt tggatgaaag 2040 
tgacatcatt gctcagctcc caccacccat 2100 
ggcactgtct gtgtccttcc ttgaactgtc 2160 
taaaacaagt tattcttgag gaaaaaaaaa 2220 

2223 



<210> 124 
<211> 728 
<212> DNA 
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<220> 

<221> misc_feature . 

<223> Incyte ID No: 3043734CB1 

<400> 124 

gcggcgtttc tggtggccag gcatcccggt 
accgacccat gtcgcgcccg cattgggtcc 
ctttccagtc gccgggagtc tgagtcgcgg 
cggtcgttat gaggacggat ctaaaatgac 
caggagatcc atcagtgagc agttgcggga 
gaagaacgtc cgggagaggc ggctggcaga 
ccgtgctgcc gggttcccgc aatacgctca 
cattgtggct gtcaagaatg atcatgattt 
caggtaaacc atgtgaagta tttttgtttc 
ctatactgaa gggcgagctc agctattcgg 
tgacccaggt gcaggttcta aatgtactac 
atggagctta aatcctagca ggggcctttg 
aagagaga 



cctcgcgcgt ggcgcagctc ccatcgccgg 60 
cgggaccccg gcgggagtgc cgcgtccgtc 120 
gccacgcggg agtggcggtg gagagcccgc 180 
cagcaaacgg aaaccttgcc aaacgcagct 240 
ctccacggcc agagcctggg atctgctgtg 300 
aattgaggca aaagaagcat gtgactggct 360 
gttatatgag gattcacaat ttcccatcaa 420 
tcttgaaaag gaccttgtag aacctctttg 480 
tttccactgt tcagtctgca acaggcatca 540 
caagtattca ctgagtgcct accatgtgcc 600 
tgttaatgag catgatcagt ttgtgttttc 660 
gacactagat taggaaaatg acagagaaag 720 

728 



<210> 125 
<211> 2161 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3294893CB1 



<400> 125 

gaggcggaag 

ggggcacagg 

gtctgaggga 

cacaacagca 

gatccactgt 

aaggccaaga 

gctggggagg 

gtcatggaga 

ctgcaggagg 

cccagcggcg 

gaaagtgcca 

gtcacgctgg 

gaggcctccc 

gaattacgga 

ctcctctcag 

caaaaagaat 

caggacacat 

ctgcagccca 

ccagccagtg 

atttgtgaat 

gaggtgctgt 

attcaggaca 

gaactcgtga 

tacctgcagt 

tgtaccatcc 

ttgtacgtgt 

tttcaagagc 

atcagcctgt 

cactacaaga 

cccacctgcc 

gtgaggtggc 

ctgacaaaga 



agcttctcgg 
aaagggccgc 
tgaggagggg 
agctcccggg 
tacatggcac 
gggtcagcac 
acgacgatga 
cggccaaccg 
ggccagggct 
acctccggct 
gcgatgccgc 
gtggcacatc 
ggctcgacaa 
ggttgagctg 
gttaccttcc 
attttgcatt 
acaggcagat 
aggtgacgga 
gatacgttca 
acatagaggc 
gcaacatcga 
actacacctt 
gccggattga 
ttgccttccg 
gcctgtggga 
gcgctgcttt 
tgctgctctt 
tgctggccga 
aatgagccca 
tggctggtgg 
cccaccctcc 
cagggacagc 



ctctaggctc 
taggggaggg 
ccatggccag 
cagcatccag 
tttgctcagg 
cttccaggag 
gctcctggcc 
tgtgctgcgt 
tcagcagaag 
ggtgaagtcg 
ccctctgcag 
tgaccccagc 
gttcaagcag 
gtccggaatc 
cgccaatgta 
tattgagcac 
ccacatagac 
gatttttgaa 
gggtataaat 
agaggaggtg 
ggccgacacc 
tgcccaacct 
tgagcaagtg 
ctggatgaac 
cacctaccag 
tctcgtgaga 
cctccagaac 
ggcctaccgc 
ggcccacccg 
taggcccctg 
aggggagctg 
ctttgttttc 



tggagtcccg 
ccgggtgcac 
cgacggggcc 
cacgtgtatg 
tccacggcca 
tttgagagca 
atggcggcgg 
aaccacagcc 
cccaggcccg 
gtcagtgaga 
aggtcccagt 
actctcagca 
ctgcttgccg 
cctaagccag 
gaccggagac 
tattacgatt 
atccctcgca 
aggatcttgt 
gatctcgtca 
gacacggtgg 
tactggtgca 
gggattcaaa 
caccggcacc 
aacctgctga 
tctgaaccgg 
tggaggaagg 
ctgcccacag 
ctcaagtttg 
cagctggcct 
tgagctggtc 
gtgaagatgg 
tgagatacca 



ggagcagtga 
tcggggtgtc 
aggaagcaat 
gtgcccagca 
agatgccgac 
ataccagcga 
agagcctgaa 
agcggcaggg 
aggcagagcc 
gccacacgtc 
ctctcccaca 
gctcagcgct 
gccccaacac 
tgcgtccaat 
cagccactct 
ctaggaacga 
tgagccctga 
tcatatgggc 
ctcctttctt 
acgtctccgg 
tgagcaagct 
tgaaagtgaa 
tggaccaaca 
tgagggaggt 
acggcttttc 
aaatactaga 
cccactggga 
cttttgccga 
cactgtcccg 
ccgggctgct 
gccacagacc 
aagagagcca 



ggggccaccc 
tgggccgcgg 
tctggaagcg 
cccccccttt 
cacaccagtg 
tgcctgggac 
ctccgaggtg 
gcggcccacg 
gccctcaccc 
ctgtcctgca 
ctcggccacc 
gagcgaaaga 
ggaccttgag 
gacgtggaag 
ccagagaaaa 
cgaagttcac 
agcgttgatc 
gatccgccac 
tgtggtcttc 
cgtgcccgca 
gctggatggc 
aatgttagaa 
cgaagtgaga 
gcccctgcgt 
tcatttccac 
agaaaaagat 
tgatgaggac 
cgcccccaat 
ggtggcgcgc 
aaaaggcctt 
tggtctaggg 
ggggagggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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ccgggttcgg cggccagagg caggtcaggg 
gccaaatgac tgcctcctgc tgcccctagt 
ggagtcctgc tgtctgggtg ccagggccgg 
tgaaatgcaa aacttctatt ctgttgagtg 
c 



gtcccctctc cctctccctg caatgtcctt 1980 
ccggggcagc ctaggaggcc caccctcttt 2040 
aacgaggtag tggccatctc atacctactc 2100 
aaaaaataaa atgtagacaa aaaaaaaaaa 2160 

2161 



<210> 126 

<211> 2782 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3349052CB1 

<400> 126 

attagctgcc ggcgtgactt tgaccgcttc ccggtgcgtt accggcagct gaacccaccc 60 
ggcgtcacgg gactttgacg cgtgctctgc gcttgccatg agactcctgg gagccgcagc 12 0 
cgtcgcggct ctggggcgcg gaagggcccc cgcctcccta ggctggcaga ggaagcaggt 180 
taattggaag gcctgccgat ggtcttcatc aggggtgatt cctaatgaaa aaatacgaaa 240 
tattggaatc tcagctcaca ttgattctgg gaaaactaca ttaacagaac gagtccttta 300 
ctacactggc agaattgcaa agatgcatga ggtgaaaggt aaagatggag ttggtgctgt 360 
catggattcc atggaactag agagacaaag aggaatcact attcagtcag cagccactta 420 
caccatgtgg aaagatgtca atattaacat tatagatact cctgggcatg tggacttcac 480 
aatagaagtg gaaagggccc tgagagtgtt ggatggtgca gtccttgttc tctgtgctgt 540 
tggaggggta cagtgccaga ccatgactgt caatcgtcag atgaagcgct acaacgttcc 600 
gtttctaact tttattaaca aattggaccg aatgggctcc aacccagcca gggccctgca 660 
gcaaatgagg tctaaactaa atcataatgc agcgtttatg cagataccca tgggtttgga 720 
gggtaatttt aaaggtatta tagatcttat tgaggaacga gccatctatt ttgatggaga 780 
ctttggtcag attgttcgat atggtgagat tccagctgaa ttaagggcgg cggccactga 840 
ccaccggcag gagctaattg aatgtgttgc caattcagat gaacagcttg gtgagatgtt 900 
tctggaagaa aaaatcccct cgatttctga tttaaagcta gcaattcgaa gagctactct 960 
gaaaagatca tttactcctg tatttttggg aagcgccttg aagaacaaag gagttcagcc 1020 
tcttttagat gctgttttag aatacctccc aaatccatct gaagtccaga actatgctat 1080 
tctcaataaa gaggatgact caaaagagaa aaccaaaatc ctaatgaact ccagtagaga 1140 
caattcccac ccatttgtag gcctggcttt taaactggag gtaggtcgat ttggacaatt 1200 
aacttatgtt cgcagttatc agggagagct aaagaagggt gacaccatct ataacacaag 1260 
gacaagaaag aaagtacggt tgcaacggct ggctcgcatg catgccgaca tgatggagga 1320 
tgttgaggaa gtatatgccg gagacatctg tgcattgttt ggcattgact gtgctagtgg 1380 
agacacattc acagacaaag ccaacagcgg cctttctatg gagtcaattc atgttcctga 1440 
tcctgtcatt tcaatagcaa tgaagccttc taacaagaac gatctggaaa aattttcaaa 1500 
aggtattggc aggtttacaa gagaagatcc cacatttaaa gtatactttg acactgagaa 1560 
caaagagaca gttatatctg gaatgggaga attacacctg gaaatctatg ctcagaggct 1620 
ggaaagagag tatggctgtc cttgtatcac aggaaagcca aaagttgcct ttcgagagac 1680 
cattactgcc cctgtcccgt ttgactttac acataaaaaa caatcaggtg gtgcaggcca 1740 
gtatggaaaa gtaataggtg tcctggagcc tctggaccca gaggactaca ctaaattgga 1800 
attttcagat gaaacattcg gatcaaatat tccaaagcag tttgtgcctg ctgtagaaaa 1860 
ggggttttta gatgcctgcg agaagggccc tctttctggt cacaagctct ctgggctccg 1920 
gtttgtcctg caagatggag cacaccacat ggttgattct aatgaaatct ctttcatccg 1980 
agcaggagaa ggtgctctta aacaagcctt ggcaaatgca acattatgta ttcttgaacc 2040 
tattatggct gtggaagttg tagctccaaa tgaatttcag ggacaagtaa ttgcaggaat 2100 
taaccgacgc catggggtaa tcactgggca agatggagtt gaggactatt ttacactgta 2160 
tgcagatgtc cctctaaatg atatgtttgg ttattccact gaacttaggt catgcacaga 2220 
gggaaaggga gaatacacaa tggagtatag caggtatcag ccatgtttac catccacaca 2280 
agaagacgtc attaataagt atttggaagc tacaggtcaa cttcctgtta aaaaaggaaa 2340 
agccaagaac taactttgtt tactgtgagt tgactgactc taattgaatc tgcgtggttt 2400 
tgatactttg atggattcca gtggaataaa ttcaggctgc tgaaacaaga aattctgagc 2460 
ccaggaagcg ggctcttctt tcttcaaaag aagcccttct tgttcatatt caggagcttc 2520 
tgttatattc aaaggtaatt ctatgtctat ctcaactcta ttgattggtt ttatagttta 2580 
ttgaaaatcc tcaaataaaa tataattatt actgaaatat gtttaatatt taaggggaaa 2 640 
agagactaat ttcagttata cttttaagct tagaatgtat gttcatttcc aaattttgta 2700 
tcataagagt tttcaacata gagaaaagct gaaaaaatgc aaagaataac cacatacttt 2760 
ccatctacct tcctttggta ac 2782 
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<210> 127 

<211> 3019 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3357264CB1 

<220> 

<221> unsure 
<222> 985 

<223> a, t, c, g, or other 
<400> 127 

tggctgggtc cgcgggcggg ggaaggtgtc 
caggcccatc ctgtggtagg ccgtcccagg 
cgacatccca tcctagcccc gcgatgtgcg 
gacgtaaatt gcagaa.tcaa agagctgctt 
tgaggaccgg agcggaaaac cttctgaaag 
'tgcggctgga gctgagcttc gtcaactcag 
ggctgaacat ctcggtgggc gtctatcaga 
ttcctcttgg cctgaaggaa acgaaagacg 
tcctggaaca ttacagtgaa gatggctatt 
atctgagaca agcttgtcgg acgcctagcc 
catacttcat ccagctgggc tttgtcgaga 
gactcctgtt cacctggtat gactctctca 
tgctggagaa ggccagtgtc ctgttcaaca 
ggtgtgatcg gcagacgcag gctgggctgg 
caggggtttt aaattacctg aaagacacat 
ctgccatgct cagcgtgctc gtcaaaatga 
agaaaatcag ccttcctggg atccngaatg 
aggctgctaa ggtgggagag gtctaccaac 
tgaaagagaa catcccctac tcctgggcca 
cggccctggc ccactacttc actgccatcc 
cggatctgga ccaccaggag aagtgcctgt 
tgacaccctt ggccacactg aagaatgatc 
tgcgcagagc catggctcat cacgaggagt 
tgcggacgat tgaggtgcta cagaaggtgc 
cgtacgccca gcaccaggag gaggatgacc 
ttgctaaaac tgagcaagag gttgacatta 
cggacttctt ccagaagctg ggccccttat 
ctcctcgaag catccgcttc actgcagaag 
acgcccccgt tcaggttcac ttcctggatc 
gggaaggaga ttatattgtc tccattcagc 
aggttatgaa gctgctgaag agctttggcg 
tcctggactc cacatcatcc atgcataata 
aaacgtactc catgatctgc ttagccattg 
aaatctccaa gaagctttcc ttcctgagtt 
ccagcacctt gtgcctccca tcggtcgggg 
cctccccttt cagccttctc aactcagaca 
tgttcaggcc ccgaacattt ccggtgctga 
aaatatctat ctatctgttc tcaaatcctg 
gtgtttttat gaaggaaagt aaccaagaaa 
ttgaggtgtg ttactatact gctgtaagtt 
agtgttgaag atatgaaata tggctatttt 
attaaattgg ggttacctat atcagtacaa 
tcattcctta acctagaatt cagatgatcc 
gatagaatta aattagtgtc actaaaaact 
cattaaccta aaacaagatg ttactattat 
aaagtctagt gccaaatttg tcttattaaa 
agtttgatgt catttaagaa ttactaacac 
caaggtagat aatatactgt ttgatactca 



ctagcggccc gagcctgcgc tccggattct 60 
caggagttgc ctcggaggat ttggcagcca 120 
gggctgtaat ccccttgcac aaaccggccg 180 
tgaatcagca gatcctgaaa gccgtgcgga 240 
tggccacaaa ctcaaaggtg cgggagcaag 300 
acctgcagat gctcaaggaa gagctggagg 3 60 
acacagagga ggcatttacg attcccctga 420 
tcgactttgc agtcgtcctc aaggatttta 480 
tatatgaaga tgaaattgca gatcttatgg 540 
gggatgaggc cggggtggaa ctgctgatga 600 
gtcgattctt cccgcccaca cggcagatgg 660 
ccggggttcc ggtcagccag cagaacctgc 720 
ctggggccct ctacacccag attgggaccc 780 
agagtgccat agatgccttt cagagagccg 840 
ttacccatac tccaagttac gacatgagcc 900 
tgcttgcaca agcccaagaa agcgtgtttg 960 
aattcttcat gctggtgaag gtggctcagg 1020 
agctacacgc agccatgagc caggcgccgg 1080 
gcttagcctg cgtgaaggcc caccactacg 1140 
tcctcatcga ccaccaggtg aagccaggca 1200 
cccagctcta cgaccacatg ccagaggggc 1260 
agcagcgccg acagctgggg aagtcccact 1320 
cggtgcggga ggcgagcctc tgcaagaagc 1380 
tgtgtgccgc acaggaacgc tcccggctca 1440 
tgctgaacct gatcgacgcc cccagtgttg 1500 
tattgcccca gttctccaag ctgacagtca 1560 
ctgtgttttc ggctaacaag cggtggacgc 1620 
aaggggactt ggggttcacc ttgagaggga 1680 
cttactgctc tgcctcggtg gcaggagccc 1740 
ttgtggattg taagtggctg acgctgagtg 1800 
aggacgagat cgagatgaaa gtcgtgagcc 1860 
agagtgccac atactccgtg ggaatgcaga 1920 
atgatgacga caaaactgat aaaaccaaga 1980 
ggggcaccaa caagaacaga cagaagtcag 2040 
ctgcacggcc tcaggtcaag aagaagctgc 2100 
gttcttggta ctaatgtgag gaaacaaaca 2160 
ctcggcctta aacgtttgtg ccataatgga 2220 
tttttctcat agtgtaaact cacatttgat 2280 
cctctaggaa ttagtgaaaa aagaactttt 2340 
atttattata taaagtattg taaatagaat 2400 
taatggtgac aattatgact tttagtcact 2460 
tttgtagttg tttccaggtt tggctaataa 2520 
tggaattaag gcaggtcaga ggactgtaat 2580 
gtcccaaagt gctgcttcct aataggaatt 2640 
atcgatagac tatgaatgct atttctagaa 2700 
taaaaacaat gtaggagcag cttttcttct 2760 
agtggcagtg ttagatgaag atgctgtcta 2820 
aaacattttt cattttgttt aaagtagaag 2880 
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ttacataatt ctatatttta agtcttgggt aaaaaagtag ttttacattt tataaagtaa 2940 
agatgtaaat gattcagctt taaagctcta tttgacttcc ttcttttgtc tgagatagcg 3000 
tccagactgc gaaaagcga 3019 

<210> 128 

<211> 2312 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; 3576329CB1 

<400> 128 

gccggcgcgc ggtggggcat ggcgggttcg cggggtgcgg ggcgcacggc ggcgccgagc 60 
gtgcggccgg agaagcggcg gtctgagccc gaactggagc ctgagcccga gccggagccc 120 
cccctcctct gcacctctcc tctcagccac agcaccggca gcgattctgg cgtctccgac 180 
agcgaggaga gtgtgttctc aggcctggaa gattccggca gtgacagcag tgaggatgat 240 
gacgaaggcg acgaggaggg agaggacgga gcccttgatg acgagggcca cagtgggatt 300 
aaaaagacca ctgaggagca ggtgcaggcc agcactcctt gcccgaggac agagatggcg 3 60 
agcgcccgga ttggggatga gtatgcggag gacagctctg atgaggagga catccggaac 420 
acggtgggca acgtgccctt ggagtggtac gatgacttcc cccacgtggg ctacgacctg 480 
gatggcaggc gcatctacaa gcccctgcgg acccgggatg agctggacca gttcctggac 540 
aagatggacg atcctgacta ctggcgcacc gtgcaggacc cgatgacagg gcgggacctg 600 
agactgacgg atgagcaggt ggccctggtg cggcggctgc agagtggcca gtttggggat 660 
gtgggcttca acccctatga gccggctgtc gacttcttca gcggggacgt catgatccac 72 0 
ccggtgacca accgcccggc cgacaagcgc agcttcatcc cctccctggt ggagaaggag 780 
aaggtctctc gcatggtgca cgccatcaag atgggctgga tccagcctcg ccggccccga 840 
gaccccaccc ccagcttcta tgacctgtgg gcccaggagg accccaacgc cgtgctcggg 900 
cgccacaaga tgcacgtacc tgctcccaag ctggccctgc caggccacgc cgagtcgtac 960 
aacccacccc ctgaatacct gctcagcgag gaggagcgct tggcgtggga acagcaggag 1020 
ccaggcgaga ggaagctggg ctttttgcca cgcaagttcc cgagcctgcg ggccgtgcct 1080 
gcctacggac gcttcatcca ggaacgcttc gagcgctgcc ttgacctgta cctgtgccca 1140 
cggcagcgca agatgagggt gaatgtagac cctgaggacc tcatccccaa gctgcctcgg 1200 
ccgagggacc tgcagccctt ccccacgtgc caggccctgg tctacagggg ccacagtgac 12 60 
cttgtccggt gcctcagtgt ctctcctggg ggccagtggc tggtttcagg ctctgacgac 1320 
ggctccctgc ggctctggga ggtggccact gcccgctgtg tgaggactgt tcccgtgggg 13 80 
ggcgtggtga agagtgtggc ctggaacccc agccccgctg tctgcctggt ggctgcagcc 1440 
gtggaggact cggtgctgct gctgaaccca gctctggggg accggctggt ggcgggcagc 1500 
acagatcagc tgttgagcgc cttcgtcccg cctgaggagc cccccttgca gccggcccgc 1560 
tggctggagg cctcagagga ggagcgccaa gtgggcctgc ggctgcgcat ctgccacggg 162 0 
aagccagtga cgcaggtgac ctggcacggg cgtggggact acctggccgt ggtgctggcc 1680 
acccaaggcc acacccaggt gctgattcac cagctgagcc gtcgccgcag ccagagtccg 1740 
ttccgccgca gccacggaca ggtgcagcga gtggccttcc accctgcccg gcccttcctg 1800 
ttggtggcgt cccagcgcag cgtccgcctc taccacctgc tgcgccagga gctcaccaag 1860 
aagctgatgc ccaactgcaa gtgggtgtcc agcctggcgg tgcaccctgc aggtgacaac 192 0 
gtcatctgtg ggagctacga tagcaagctg gtgtggtttg acctggatct ttccaccaag 1980 
ccatacagga tgctgagaca ccacaagaag gctctgcggg ctgtggcctt ccacccgcgg 2040 
tacccactct ttgcgtcagg ctcggacgac ggcagtgtca tcgtctgcca tggcatggtg 2100 
tacaatgacc ttctgcagaa ccccttgctg gtgcccgtca aggtgctgaa gggacacgtg 2160 
ctgacccgag atctgggagt gctggacgtc atcttccacc ccacccagcc gtgggtcttc 2220 
tcctcggggg cagacgggac tgtccgcctc ttcacctagc tgttctgcct gcctggggct 2280 
ggggtggtcg tgctgaagtc aacagagcct tc 2312 

<210> 129 
<211> 921 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3805550CB1 
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<400> 129 

aggcggagtc ggggcggtgt gctgaggtgg gcctgagggc ggagtcgagg tcgggctgaa 60 
ggcggagtcg gggagggctg aggtgggcct gaaggcagag tcgaggccat ggcagggccg 120 
ggcccaggcc cgggggaccc ggacgagcag tacgatttcc tgttcaagct ggtgctggtg 180 
ggcgacgcaa gcgtgggcaa gacgtgcgtg gtgcagcgct tcaagaccgg cgccttctcg 240 
gagcgccagg gaagcaccat cggcgtcgac ttcaccatga agacgctgga gatccagggc 300 
aagcgggtca agctgcagat ctgggacacg gccggccagg agcggttccg caccatcacc 360 
cagagctact accgcagtgc caatggggcc atccttgcct acgacatcac caagaggagc 42 0 
tccttcctgt cggtgcctca ctggattgag gatgtgagga agtatgcggg ctccaacatt 480 
gtgcagctgc tgatcgggaa caagtcagac ctcagcgagc ttcgggaggt ctccttggct 540 
gaggcacaga gcctggctga gcactatgac atcctgtgtg ccattgagac gtctgccaag 600 
gactcgagca acgtggagga ggccttcctg agggtggcca cggagctcat catgcggcac 660 
gggggcccct tgttcagcga gaagagcccc gaccacatcc agctgaacag caaggacatc 720 
ggagaaggct ggggctgcgg gtgctgacca ggggccgggc cggcagactg ggggttcccc 780 
acctccttgc tctccccagc ctgccaagcc cagccctcca gagccagccc tcctgggtac 840 
cggcaactac agcagccggg tgaagctctg gagctctgca tcctgtggcc tggctgcggg 900 
atggaggctc tccttgagga a 921 

<210> 130 

<211> 1291 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4546403CB1 

<400> 130 

ctcgagcgaa tcggctcgag 
agcttctttg tcactggcac 
tgtctgcata gtgaaaaagc 
ccagtttctg atggagaaca 
tgccaagtca aaatttggat 
aaaagtacac tgagtgggca 
cagatgctag tctcagggtc 
aatatacttc acacattgac 
aatacccttt tacttgctac 
ctggaaacac tttgccaagc 
tggtcagagg aggatgtctc 
attttcaaga tgaataacat 
gctgatgatt tgaaaattga 
gagctcagga ccaaggttaa 
ataactagag aacttatgaa 
gaagcaatgg aaaattggat 
cttccttcag cggtacttac 
gagacacacc aaaagtaaaa 
tgaatgattt ataggtaaat 
aacttcttaa atttagttac 
acttactata aaagcctttt 
ttatacttta aattgcaaaa 

<210> 131 

<211> 1836 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 4767318CB1 

<400> 131 

ttttagaagg ttagtgttgg ttcttttatt cgattaaaca ggaatacaca tatgtctacc 60 
aaagaatagg taagggagaa ataagaacac taaaaaaact cggaatcgtt aagtgtgaag 120 



agatggctcc ttggcggcat 
ttcatgtggt gatttaacag 
acatgatctt ggaattacct 
aggtcttcag ttttttcgac 
tgtttctttt acccatatct 
ctgtgctcct gttctggctt 
agtggataag tctgtcatag 
tcagcacacc aggtatgtca 
tggttcaatg gacaaaacag 
aaggagcaca gaacatcagc 
aacatggctt tgtgcacaag 
tgatggaaaa gaactgttga 
atctctagga ctgcgtagta 
atccctttct tcaggaattc 
agatccggtc atcgcatcag 
cagcaaaaag aaacgtacaa 
accaaatagg actctgaaaa 
ttgttgatat tgtattattt 
actaatcaga cattattaaa 
ctataaaaat tgtcaatttt 
tgtactagtg aaaagaatct 
aaaaaaaaaa a 



gtgcattttc tcctaatgga 60 
tgtgggatga tcaaatgagg 120 
gctgcgattt ttcttcacag 180 
tggcatcatg tggtcaggat 24 0 
taggttttga attaaaatat 300 
gtgctttttc ccatgatggg 360 
tatatgatac taatactgag 420 
caacttgtgc ttttgcacct 480 
tgaacatctg gcaatttgac 540 
tgaagcaatt taccgaagat 600 
atttaaaaga tcttgttggt 660 
atcttacaaa agaaagtctg 720 
aagtgctgag gaaaattgaa 780 
ctgatgaatt tatatgtcca 840 
atggctattc atatgaaaag 900 
gtcccatgac aaatcttgtt 960 
tggccatcaa tagatggctg 1020 
atattttcag tgatctcatt 1080 
agcaaaacag gaaaaaggta. 1140 
cattctttaa aaacacatgg 1200 
tcagctatat agaaataaag 1260 

1291 
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catatttgga gttaaaagaa ccaaatatta 
taccgggatt tgtagtccct tccggggcgg 
gctcttccgg gcgcatgcgt gcggcagcgg 
tgcagtgttg tgagttggaa gctggggagc 
ggccctcggc gttgggccag agcggccccg 
gcggcccgtc gcgctacgtg cttgggatgc 
gcgagttcct ggcgcacagc gccaaggtgc 
gcctagcctc ggggtccttc gacaagacgg 
tggtcaaaga aaacaattat cggggacatg 
caagtaatcc tgacctattt gttacggcgt 
tgaggactac aaaatgcatt gccactgtga 
ggagtcctga tgggcagacc attgctgtag 
atgccaagac acaccgttcc aaagcagaag 
cctggaacaa tgacaataat atgttcttcc 
tcagctaccc agaactgaag cctgtgcagt 
gtatcaagtt tgaccccatg gggaagtact 
gcctctggga tgtggatgag ttagtgtgtg 
taagaaccct cagtttcagc catgatggga 
ttattgacat tgctgaagtg gagacagggg 
cgaccttcac agtggcgtgg caccccaaaa 
aagacggcaa atatgacagc agccgggaag 
atgattcttg agaggaggtt gtagggagag 
gttagtttgg tctgttctct cggagttggt 
aattgtaaac gtctctggtc aggctgcgca 
ctctttccat tctttgcccc cagcatgagt 
tgaccccagc ataccttgtc ctctggacct 
gactttgtgg catttgatac aacttggaca 
ggcatggaag tttttttaaa taattaaaaa 
ttaataaact gacttttgtt attatggaaa 



ctaagtaagc agacgcgggc acgcgctgca 180 
ggtacagcgc gcctgcgcag aggggccgtc 240 
cgccaggact gactgcgccg tggaggctgc 300 
tcggcatggc ggtccccgct gcagccatgg 360 
gctcgatggc cccgtggtgc tcagtgagca 42 0 
aggagctgtt ccggggccac agcaagacgc 480 
actcggtggc ctggagttgc gacgggcgtc 540 
ccagcgtctt cttgctggag aaggaccggt 600 
gggatagtgt ggaccagctt tgttggcatc 660 
ctggagataa aaccattcgc atctgggatg 720 
acactaaagg ggagaacatt aatatctgct 780 
gcaacaagga tgatgtggtg acctttattg 840 
agcagttcaa gttcgaggtc aacgaaatct 900 
tgacaaatgg caatggttgt atcaacatcc 960 
ccatcaacgc ccatccttcc aactgcatct 1020 
ttgccacagg aagtgcggat gctttggtca 1080 
ttcggtgctt ttccaggctg gattggcctg 1140 
aaatgctggc gtcagcatcg gaagatcatt 1200 
acaaactatg ggaggtacag tgtgagtctc 1260 
ggcctctgct ggcatttgcc tgtgatgaca 132 0 
ccggaactgt gaagctgttt gggcttccta 1380 
gaggccccgg cagaggtctt ccttcatgtg 1440 
gggcacccta aatatttgta agttggtata 1500 
tttcgttctt ttgctttgtc tgtgtattag 1560 
taactcgcgt ggactctgca gtgcgagtag 1620 
cctgtcttct ctgcttctgg gtgcatggta 1680 
atacctagtt tggagggagg ggaatggaag 1740 
tatatacata taattttgag aattgagcat 1800 
aaaaaa 183 6 



<210> 132 

<211> 2136 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4834527CB1 



<400> 132 

ggcgcgccgg 

aggcagcctc 

cctggatgga 

agccgactac 

catcgagccc 

ggacaagccg 

gttggccgag 

ccttaaggac 

tcagctccta 

ggtgggtgcc 

gctccttggt. 

ggagcggctg 

tctggccagc 

atggttcaac 

gcagagggcc 

caacttcaac 

gaagaagacc 

cccaacgggg 

gacggcccac 

catctacttc 

gaaattcctg 

tcccttcgag 

ggatggtctt 



gagccggcag 
agcccacccc 
gcgccatcct 
taccccgaga 
cgggagctcc 
gtgctggaca 
tggaccgaga 
gtcgtgggcc 
caggctctgc 
gacaagccca 
gtctgcagcg 
cggcacatcg 
acaaagccct 
aggctgtgct 
caggtgattg 
tccctcatgg 
tgggccaaag 
aatttctgca 
agcagccgag 
ctgaatgagg 
gagctggcca 
caagacgcca 
tatttggctt 



acatgccaca 
ccaccgagcc 
cagcctccct 
aagcctacat 
tggcccgggt 
aggcccgggt 
ccttcccaag 
gcatcgcccc 
accagaagct 
tctcctacag 
acccctacac 
ggcctgagga 
gcttcagtga 
acctggtggc 
agttcttcat 
ccatcatctc 
tgaggacggc 
actacaggac 
agaagattgt 
gctgcgccaa 
agcaggtggg 
gcatcaccca 
cttatgaaag 



gacgctgagt 
cacagatggc 
ggaaacactg 
cttcaccttc 
ctgccacctg 
ccggaagttc 
ggacttccag 
ctgtgacgag 
ggcggctctg 
gaccaagcca 
actggcccag 
gtttgtccag 
caagaccagc 
aactgagatc 
cgacgtggcc 
cggcatgaac 
caagtttttc 
agccctgcgc 
cattcctttc 
ccgccttccc 
ggagttcatc 
ctacctgtac 
tgagagccca 



gcctccgaca 
gaacaggctg 
atccagcacc 
ctgctgagct 
tgcatcgagc 
ggccccaaac 
gaagagtcga 
gcataccgga 
cgccaggggc 
ccagcctcca 
cagctgaccc 
gcctttgtga 
aacctggagg 
tgcatgccag 
cgcgagtgct 
atgagccctg 
atcctcgagc 
ggggcggccc 
ttcagcctgc 
aatggacacg 
acctggaaac 
accgccccca 
gagaaccaaa 



tggtcacccc 
ggcagcccct 
tggtgcccac 
ctcgcctctt 
agcagcagct 
tgctgcagct 
ctatcgggca 
agaggatgca 
cagaaggtct 
tccacaggga 
acgtggaact 
acaaggaccc 
cttatgtgaa 
ccaagaagaa 
tcaacatcgg 
tctccaggct 
accagatgga 
accgctccct 
tcatcaaaga 
tcaactttga 
aagtggagtg 
tcttcagtga 
cagaaaaaga 
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aagatggaaa gctctaagat cttctatttt 
acgaggaaga gctggagccc gcagaagccg 
cagaggccag ggagtgcctc actattttgc 
ccccgccccc cacagtggcc atacgggcac 
caggacccag ggcctccaga cgtgcgggcg 
gaccgtgggg cctggtcagt ctgtccatcc 
tgggaccact gcaagctcga gaccttgctt 
cagtctgtca ccacgtggct tgggaacttc 
tgagatgcct gcgccagccc cgagcccact 
tcaccctgcc ctgccgacga gcttgtctct 
gctggcccca ggagatagct ttccggtgac 
gacacatgga gttacagtgt gtacacggca 
agtcactata aacacacccg tacgcctaaa 



ggggaagaca tgaaagcgct gagctgaggg 1440 
tccacagccc tgcctcagtg gcccagtggg 1500 
aaatgccgac cctgtggcct gctgcccgcc 1560 
aggagacctt ttatgggact ttggccctgg 1620 
gcacatgcct tggggacatc ctgccttcag 1680 
tcggcaagga cacaacactg ccccagaggg 1740 
ggtgacatgt gccactttgg ccaccaccca 1800 
tggagccaca gcaggcatca cggtgcgacg 1860 
ggcagccact gccattccac ccatggtccc 1920 
gcagccccag gtaccccctt cctggatgct 1980 
agctgtggaa cgcgtcagca ggacaaactg 2040 
gtcccgccac ccagccccct tgtaaactct 2100 
aaaaaa 213 6 
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